Optimal COVID-19 therapeutic candidate discovery using the CANDO platform

Mangione, William; Falls, Zackary; Samudrala, Ram

doi:10.3389/fphar.2022.970494

ORIGINAL RESEARCH article

Front. Pharmacol., 25 August 2022

Sec. Drugs Outcomes Research and Policies

Volume 13 - 2022 | https://doi.org/10.3389/fphar.2022.970494

Optimal COVID-19 therapeutic candidate discovery using the CANDO platform

Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, United States

Abstract

The worldwide outbreak of SARS-CoV-2 in early 2020 caused numerous deaths and unprecedented measures to control its spread. We employed our Computational Analysis of Novel Drug Opportunities (CANDO) multiscale therapeutic discovery, repurposing, and design platform to identify small molecule inhibitors of the virus to treat its resulting indication, COVID-19. Initially, few experimental studies existed on SARS-CoV-2, so we optimized our drug candidate prediction pipelines using results from two independent high-throughput screens against prevalent human coronaviruses. Ranked lists of candidate drugs were generated using our open source cando.py software based on viral protein inhibition and proteomic interaction similarity. For the former viral protein inhibition pipeline, we computed interaction scores between all compounds in the corresponding candidate library and eighteen SARS-CoV proteins using an interaction scoring protocol with extensive parameter optimization which was then applied to the SARS-CoV-2 proteome for prediction. For the latter similarity based pipeline, we computed interaction scores between all compounds and human protein structures in our libraries then used a consensus scoring approach to identify candidates with highly similar proteomic interaction signatures to multiple known anti-coronavirus actives. We published our ranked candidate lists at the very beginning of the COVID-19 pandemic. Since then, 51 of our 276 predictions have demonstrated anti-SARS-CoV-2 activity in published clinical and experimental studies. These results illustrate the ability of our platform to rapidly respond to emergent pathogens and provide greater evidence that treating compounds in a multitarget context more accurately describes their behavior in biological systems.

1 Introduction

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and the disease caused by its infection, COVID-19, was first documented in Wuhan, China in December 2019. It spread rapidly and was declared a pandemic by the World Health Organization in March 2020, causing over 5.9 million deaths across the world as of February 2022 (Organization, 2022). The scientific community immediately began employing various tools and methods to identify medical interventions that would reduce the threat posed by this novel coronavirus. Numerous institutions conducted clinical trials evaluating the ability of therapeutics to decrease COVID-19 lethality, often reporting conflicting results for the same drug (e.g. chloroquine and remdesivir) (Wang Y. et al., 2020; Chowdhury et al., 2020; Spinner et al., 2020). Few clearly conclusive success stories were reported in the months immediately following the outbreak with the most notable being dexamethasone, an anti-inflammatory corticosteroid that reduced death rates in patients suffering from a hyperactive immune system response known as a cytokine storm (Group, 2021). Further, it took nearly two years for a direct antiviral therapeutic indisputably capable of significantly preventing death from COVID-19 to be approved by the FDA, specifically both molnupiravir and the nirmatrelvir/ritonavir combination drugs in December of 2021 (Mahase, 2021; Hammond et al., 2022), which speaks to the complexity of this disease and the urgent need for innovative technologies that rapidly and effectively identify promising therapies. Such technologies will not only be useful in the present but also to combat any new emerging pathogens.

Significant advances made in the field of computational drug discovery were deployed in the context of COVID-19 with the goal of uncovering viable solutions (Mohamed et al., 2021). For example, multiple studies utilized virtual docking methods to identify compounds with strong affinity to SARS-CoV-2 proteins (Vijayan et al., 2020; Wang, 2020; Baby et al., 2021). Others used network-based bioinformatics methods to suggest drug repurposing candidates or better understand SARS-CoV-2 pathology, taking advantage of large scale human and virus protein-protein interaction knowledge (Zhou et al., 2020; Ghandikota et al., 2021; Gysi et al., 2021). On the clinical side, applications of traditional and deep machine learning methods have been utilized to identify high-risk patients, such as convolutional neural networks that analyze CT and X-ray images (Ardakani et al., 2020; Ozturk et al., 2020). Deep learning approaches have also been directly applied to identify drug candidates for treating COVID-19 (Liu et al., 2021; Pham et al., 2021).

In this study we describe and evaluate the performance of our Computational Analysis of Novel Drug Opportunities (CANDO) multiscale therapeutic drug discovery, repurposing, and design platform for identifying small molecules that show potential in inhibiting the SARS-CoV-2 virus and treating COVID-19. CANDO was originally designed as a shotgun repurposing platform for exactly this type of epidemic/pandemic scenario utilizing multiscale modeling techniques and adhering to multitarget drug theory, but has since been enhanced to carry out novel drug discovery against all indications (Jenwitheesuk and Samudrala, 2003b, 2005; Jenwitheesuk et al., 2008; Horst et al., 2012; Minie et al., 2014; Sethi et al., 2015; Chopra et al., 2016; Chopra and Samudrala, 2016; Falls et al., 2019; Fine et al., 2019; Mangione and Samudrala, 2019; Schuler et al., 2019; Schuler and Samudrala, 2019; Mangione et al., 2020b; Hudson and Samudrala, 2021; Schuler et al., 2021) as well as novel drug design (Overhoff et al., 2021). The relatively recent introduction of higher order biological data such as protein pathways, protein-protein interactions, drug side effects, and protein-disease associations has further augmented our ability to describe compound behavior holistically, with subsequent improved performance (Moukheiber et al., 2021; Schuler et al., 2021; Mangione, 2022; Mangione et al., 2022). Our platform is freely available to the scientific community and a detailed description of the software implementation has been published (Mangione et al., 2020a).

We employed two separate predictive pipelines within CANDO to suggest putative drug candidates for COVID-19: one first optimized our compound-protein interaction protocol against SARS-CoV and then applied it to SARS-CoV-2, and the other searched for compounds that were similar to those known to possess anti-coronavirus activity based on interactions computed with all human proteins. We originally published three different ranked lists of putative drug candidates in March and May of 2020 using the CANDO platform (Mangione et al., 2020b; Group, 2020). In May 2020, we published an assortment of drug candidates that were highly ranked by CANDO and were at the time being investigated in clinical trials to treat COVID-19. Since then several of our top scoring compounds have been validated by us and by others which we analyze in detail here. The significant number of top-ranked therapeutics successfully validated in this study, our previous work with the Ebola Virus Disease outbreak in West Africa in 2014 (Chopra et al., 2016), as well as our earlier validation studies and analyses (Jenwitheesuk and Samudrala, 2003b,a, 2005; Jenwitheesuk et al., 2008; Costin et al., 2010; Nicholson et al., 2011; Michael et al., 2011a,b), all suggest that CANDO is an effective tool to combat newly emerging epidemics and pandemics.

2 Results and discussion

Figure 1 illustrates the pipelines and protocols used within the CANDO platform to produce the three lists of drug candidates; a detailed description follows below.

FIGURE 1

2.1 Compound-protein interaction protocol parameter optimization

We initially assessed the robustness of predictions made by the CANDO platform by inspecting the recapture rate of small molecules identified to be active against SARS-CoV, MERS-CoV, and other coronavirus species from two high-throughput screens by Shen et al. and Dyall et al. (Dyall et al., 2014; Shen et al., 2019).

We parameterized our compound-protein interaction scoring protocol via the discounted cumulative gain metric after generating many matrices using various criteria (see Section 3.4). Figure 2 depicts how well each parameter set ranked the actives present in the three separate screens. Among the top four competitive parameter sets, two did not have any screens ranked within the top 10 and were discarded. The parameter set we chose to apply to SARS-CoV-2 ranked 25th for SARS-CoV, 3rd for HCoV-NL63, and 10th for HCoV-OC43. We selected this over the other competitive parameter set because omacetaxine mepesuccinate, one of the strongest actives identified in the Dyall screen, was ranked 2nd versus being ranked 14th in the discarded set. The final interaction scoring protocol and corresponding de novo candidate generation pipeline parameters included the integer based Extended-connectivity fingerprint (ECFP) with a diameter of 10, dCxP scoring protocol, and a compound-protein interaction score cutoff of 0.9 (see Section 3.3 and Section 3.4).

FIGURE 2

2.2 Generation and validation of drug candidates

We generated three lists of drug candidates from corresponding pipelines that mixed and matched the protocols and data sources as described in the methods: 1) Using the parameters identified in the previous step, we generated a list of 155 approved drug candidates with strong interaction scores to SARS-CoV-2 proteins where the top scoring compounds all had interaction scores greater than or equal to 0.9 to one or both of the main (Mpro) or papain-like (PLpro) proteases (identified as 3.5.20 de novo). 2) The nonredundant synthesis of the 18 actives from the Shen study and 21 actives from the Dyall study as well as 2 promising manually added candidates oseltamivir and remdesivir served as input to the interaction signature similarity pipeline since it does not require EC50 values. These 38 compounds were then used to generate 45 approved drug candidates using the signature similarity pipeline (3.5.20 similarity). 3) We later repeated the similarity pipeline with a sublibrary of 85 anti-SARS-CoV-2 actives and an enhanced CANDO compound library (v2.3) to generate a list of 97 approved drug candidates (5.18.20 similarity).

We scoured the literature to see if other studies validated our candidates from our three lists against SARS-CoV-2, primarily utilizing two different resources that collate detailed information on therapeutic interventions against COVID-19: CoronaCentral and the Targeting COVID-19 Portal from the Global Health Drug Discovery Institute (GHDDI) (see Section 3.6). Table 1 gives a summary of the number of predicted candidates and validations, along with correlation coefficients and discounted cumulative gain scores. Table 2 gives a full breakdown of the validations from each list as well as two drugs with weak EC50s not counted as validated: moxifloxacin and levofloxacin. This includes full virus, main protease, other miscellaneous in vitro (for example, inhibition of SARS-CoV-2 spike protein binding to the human ACE2 receptor), and electronic health record (EHR) studies. The studies demonstrating the activities are provided in Supplementary Table S1 while the energetic stability of the designated hits are provided in Supplementary Table S2. Figure 3 uses a Sankey diagram to illustrate the validation of all candidates with EC50s less than 10μM, which includes 31 drugs that were found to be effective against SARS-CoV-2 in full virus inhibition studies. Overall, a total of 51 drugs showed efficacy against SARS-CoV-2 out of 275 nonredundant candidates for a hit rate of 18.5%.

TABLE 1

	Total	Viable	Approved	Checked	Validated	Hit rate	CC	DCG
3.5.20 de novo	225	224	155	48	21	13.5%	0.41	0.96
3.5.20 similarity	115	114	45	17	11	24.4%	0.63	0.24
5.18.20 similarity	100	97	97	48	29	29.9%	0.35	0.22
Combined	440	435	297	113	61	20.5%	0.30	—
Nonredundant	419	414	275	102	51	18.5%	0.37	—

Summary details of drug candidates generated by the CANDO platform. For each candidate list, the total number of candidates that were initially generated by our prediction modules, the number of viable candidates after manual filtering (removing ions and dyes) prior to validation, the number of approved compounds, the number of candidates that were matched via literature search using the CoronaCentral and GHDDI resources (“Checked”), the number of candidates with EHR evidence or in vitro activity less than 100 μM (“Validated”), the hit rate percentage, the Pearson correlation coefficient (“CC”) between the full virus validation ranks and their EC50 scores (including the combined and nonredundant lists), and the discounted cumulative gain (“DCG”) score are given. Overall, we obtained hit rates ranging from 13.5 to 29.9% using the CANDO platform, with the signature similarity pipelines yielding the highest success rates and the direct viral inhibition de novo pipeline accurately ranking the best, most potent, candidates.

TABLE 2

Compound	3.5.20	3.5.20	5.18.20	SARS-CoV-2	Mpro IC50	Other
Compound	de novo	similarity	similarity	EC50 (μM)	(μM)	Other
Omacetaxine mepesuccinate	1	—	—	0.03	—	—
Chlorpromazine		3	11	3.14	—	—
Clomipramine		4	—	5.63	—	—
Entrectinib	—	—	4		—	58.4 μM IC50 Spike protein binding ACE2
Mycophenolate mofetil	7	—	—	0.87	—	—
Imipramine	127	8	—	10.0	—	—
Toremifene	—	—	8	2.5	—	—
Tamsulosin	100	14	38		—	18% relative risk reduction (death)
Bepridil	15	—	—	0.86	72	—
Azelastine	—	—	15	2.24	—	—
Zuclopenthixol	—	28	18	1.35	—	—
Masitinib	—	20	50	3.2	—	—
Erythromycin	—	—	20	—	—	70% reduction SARS-2 infection at 100ug/ml
Chloroquine	—	21	96	7.28	—	—
Ritonavir	—	—	21		13.7	—
Hydroxychloroquine	—	22		4.14	—	—
Cobicistat	—	—	22		6.7	—
Amodiaquine	—	23	40	0.13	—	—
Nilotinib	—	26	—	1.88	—	4.21 μM IC50 Spike protein binding ACE2
Pimozide	—	—	26		42	—
Diphenhydramine	28	—	—	17.4	—	—
Clomifene	—	29	84	9.73	—	—
Remdesivir	30	—	—	0.76	—	—
Butenafine	—	—	35	—	5.4	—
Moxifloxacin	—	44	—	239.7	—	—
Clarithromycin	—	—	47	—	—	78% reduction in severe respiratory failure versus chloroquine
Saquinavir	—	—	54	—	9.92	—
Simeprevir	—	—	55	2.3	48.2	—
Ouabain	—	—	56	0.024	—	—
Azithromycin	—	—	57	2.12	—	—
Tranylcypromine	57	—	—	—	8.64	—
Almitrine	—	—	68	1.42	—	—
Tamoxifen	—	—	74	8.98	—	—
Colistimethate	—	—	75	—	—	Mpro 17% bound (50 μM)
Lopinavir	—	—	76	9.12	—	—
Terconazole	144	—	78	11.92	—	—
Silodosin	81	—	—	—	—	18% relative risk reduction (death)
Atazanavir	—	—	82	0.22	60.7	—
Triamterene	86	—	—	—	—	23.5 μM IC50 Spike protein binding ACE2
Hydroxyzine	90	—	—	15.3	—	0.42 hazard ratio (death)
Itraconazole	—	—	90	0.39	—	—
Ebastine	—	—	92	0.5	57	—
Avatrombopag	—	—	95	5.71	—	—
Trimipramine	99	—	—	1.5	—	—
Flunarizine	105	—	—	19.05	—	—
Tadalafil	108	—	—	—	—	100 μM IC50 preventing Spike protein binding to ACE2
Thalidomide	109	—	—	—	—	11 versus 23 median days SARS-CoV-2 negative conversion from admission, 18.5 vs. 30 days length hospital stay
Paroxetine	111	—	—	—	—	0.52 hazard ratio (death or intubation)
Ifenprodil	117	—	—	—	46.86	Mpro 39% bound (50 μM)
Nebivolol	123	—	—	2.72	—	—
Doxazosin	133	—	—	—	—	74% relative risk reduction (death)
Levofloxacin	145	—	—	418.6	—	—
Teniposide	149	—	—	—	—	46.3 μM IC50 Spike protein binding ACE2

Complete list of validated candidates generated by the CANDO platform. The names of the 51 compounds, their ranks in the 3.5.20 de novo, 3.5.20 similarity, and 5.18.20 similarity lists, the full virus EC50s, main protease IC50s, and EHR-based evidence are given. Only the lowest full virus EC50 for each candidate is shown. The de novo pipeline identified better, more potent, full virus inhibition candidates, while the signature similarity pipeline identified a greater fraction of validated candidates accurately.

FIGURE 3

In addition to these validations gathered from the literature, 30 candidates were evaluated by our collaborator, Ennaid Therapeutics, of which 11 displayed in vitro efficacy; a patent has been filed for their use (Samudrala et al., 2020).

Aside from moxifloxacin and diphenhydramine, all validations of candidates ranked in the top 50 of their respective lists have full virus EC50 values less than 10 μM. The same is true for those in the top 100 with the exception of hydroxyzine and terconazole. The second strongest reported EC50 (0.03 μM) was obtained using omacetaxine mepesuccinate, the top ranked candidate from the 3.5.20 de novo list, which is only slightly weaker than the best EC50 belonging to ouabain (0.024 μM), ranked 56 in the 5.18.20 similarity list. Figure 4 illustrates the proposed mechanism of omacetaxine mepesuccinate inhibiting SARS-CoV-2 via strong predicted interactions to the main and papain-like proteases. Two other drugs known to inhibit both SARS-CoV-2 proper as well as its main protease, bepridil and ebastine, were present in the 3.5.20 de novo and 5.18.20 similarity lists respectively, with the latter having a relatively weak interaction score to the main protease of 0.82 while the former received a score of 0.98. However, the protease inhibition activity of ebastine is supported by it being the third most similar compound to nelfinavir, a known human immunodeficiency virus protease inhibitor, based on their proteomic interaction signature similarity, suggesting the CANDO platform is capable of recognizing/predicting mechanistic behavior in multiple ways.

FIGURE 4

We also investigated why moxifloxacin was deemed a candidate despite its low reported efficacy (Figure 5). Moxifloxacin was predicted by the 3.5.20 similarity pipeline and received a score of two meaning it was in the top 25 most similar compounds to two coronavirus actives (average rank 19.5). Moxifloxacin was the 18th most similar compound to mefloquine and the 21st most similar to emetine; the former is a treatment for malaria, similar to many other anti-malarials with moderate activity (∼4–15 μM) against coronaviruses in vitro (Dyall et al., 2014; Ellinger et al., 2021), and the latter is an experimental treatment for amoebiasis with demonstrated activity against not only SARS-CoV-2 (EC50 0.46 μM) (Choy et al., 2020), but many other coronavirus species (Dyall et al., 2014; Shen et al., 2019). Moxifloxacin having similarity to one strong and one moderate anti-coronavirus compound would suggest a stronger EC50 than 239.7 μM; we attribute this result to a progressive decrease in behavioral/functional similarity signal strength/relevance as the distance between their proteomic interaction signatures relative to those of known coronavirus actives increases. In other words, the signal disappears as we move further down the ranks as depicted in Figure 5.

FIGURE 5

The second to last validation in the 3.5.20 similarity list is clomifene, an infertility treatment in women, at rank 29 with a score of 2 and EC50 of 9.73 μM; it is similar to the coronavirus active compounds tamoxifen (rank 2) and toremifene (rank 11), constituting an average rank of 6.5. Additionally, all other validations from the same list have an average rank of less than or equal to 6.5 regardless of the score, which ranges from two to six. This implies setting the cutoff rank for the canpredict module to a lower value will produce stronger candidates and is further supported by the higher hit rate observed in the 5.18.20 similarity list (29.9 vs. 24.4% for 3.5.20 similarity) which was produced with a cutoff of ten. However the candidates predicted in the 5.18.20 similarity list benefited from using anti-SARS-CoV-2 drugs specifically, as opposed to actives against other coronavirus species, and had over double the number of active compounds when compared to the actives used to generate the 3.5.20 similarity list.

The candidates generated using the human proteome interaction signature similarity pipeline had higher validation rates relative to the direct compound-protein inhibition de novo pipeline; yet some of the candidates generated by the latter demonstrated stronger in vitro efficacy. The increase in hit rate is due to the similarity pipeline utilizing the structural knowledge embedded in the results of countless coronavirus studies, whereas the de novo pipeline relies exclusively on the fidelity of the compound-protein interactions computed using our interaction scoring protocols, which are prone to inaccuracies. The de novo pipeline was better tuned to correctly rank the strong inhibitors as interaction scoring parameters were first optimized for SARS-CoV using the discounted cumulative gain metric, which prioritizes ranking the strongest active compounds near the top of the list. This suggests that weighting the active compounds based on their available EC50 values for the full proteome interaction similarity pipeline may produce more potent candidates.

Our observed hit rate of 18.5% is likely conservative as not all of the compounds from the three candidate lists have been validated for efficacy against SARS-CoV-2 in published clinical and experimental studies. Conversely, the fraction of these 51 validations analyzed in this study that will result in clinical utility is limited due to a variety of factors such as pharmacokinetics, pharmacodynamics, safety, and cost. Multiple candidates that we listed as validations, specifically chloroquine, hydroxychloroquine, and azithromycin, have had conflicting reports of clinical benefit (Wang Y. et al., 2020; Chowdhury et al., 2020; Spinner et al., 2020; Echeverría-Esnal et al., 2021); regardless, we consider them a successful prediction of the CANDO platform due to the extensive number of in vitro studies reporting their SARS-CoV-2 inhibition, which is what the compound-proteome interaction analytics pipelines present in CANDO optimize for at present. Furthermore, even if CANDO fails to accurately score a known interaction with our bioanalytic docking protocol (BANDOCK) for a compound with reported activity, as in the case of ebastine and the SARS-CoV-2 main protease, its therapeutic mechanism may still be elucidated by inspecting the behavior of highly similar compounds based on their proteomic interaction signatures. Consequently, we are actively implementing methods to further refine the feasibility of our candidates based on the aforementioned factors.

3 Methods

3.1 Compound structure library and known actives curation

The CANDO v2.1 compound library consisted of 8,696 drug and drug-like small molecule three-dimensional structures, including 1,979 approved for human use, and was extracted from DrugBank (Wishart et al., 2018); this library was used for the initial predictions. We later updated the CANDO compound library to v2.3 that included 13,194 compounds from DrugBank consisting of 2,449 approved drugs and 2,519 small molecule metabolites, with the remaining classified as experimental/investigational. Biologic therapeutics were not included in our analyses.

Initially, compounds were considered as a coronavirus active if they were identified in one of two high-throughput screens by Shen et al. and Dyall et al. (Dyall et al., 2014; Shen et al., 2019). The former screened a library of 290 compounds against SARS-CoV and Middle East respiratory syndrome coronavirus (MERS-CoV). The latter screened a 2,000 compound library against four different coronavirus strains: human coronavirus OC43 (HCoV-OC43), human coronavirus NL63 (HCoV-NL63), MERS-CoV, and murine coronavirus (MHV-A59; also known as mouse hepatitis virus). Out of 60 successful hits from both studies, 18 compounds from the Shen study along with their EC50s against HCoV-OC43 and HCoV-NL63, as well as 12 compounds from the Dyall study and their EC50s against SARS-CoV were mapped to our compound library. These three actives sublibraries were used for the compound-protein interaction scoring protocol parameter optimization (see Section 3.4).

The nonredundant combination of actives in the Shen and Dyall studies were used for the signature similarity candidate generation pipeline (see Section 3.5). We also added oseltamivir and remdesivir as at that time (February 2020) evidence suggested that they may inhibit SARS-CoV-2 or related coronaviruses (Wang M. et al., 2020; Coenen et al., 2020), resulting in an actives library of 38 compounds.

As more data became available regarding in vitro efficacy values for compounds against SARS-CoV-2, a second sublibrary of 85 actives with reported EC50 values less than or equal to 10 μM was extracted on May 7, 2020 from the Targeting COVID-19 Portal from GHDDI (Leng, 2020), which contained 17/38 compounds from the previous list. The updated CANDO compound library along with the new GHDDI actives sublibrary were used for the enhanced signature similarity candidate generation pipeline (see Section 3.5).

3.2 Protein structure library curation

The available SARS-CoV x-ray diffraction protein structures were obtained from the Protein Data Bank (PDB) (Burley et al., 2019) and initially served as our representative coronavirus proteome, comprising eighteen total structures. These eighteen SARS-CoV proteins were used for the compound-protein interaction protocol optimization (see Section 3.3).

A SARS-CoV-2 protein library of 24 structures was modeled from sequence using the I-TASSER v5.1 suite (Yang et al., 2015) and comprised the proteome used for the remaining analyses. We prioritized 18/24 proteins that were modeled by I-TASSER using homology to known coronavirus structures. These 18 SARS-CoV-2 proteins were used for the de novo pipeline, while both iterations of the signature similarity based pipeline (see Section 3.5) used a library of 5,317 human protein x-ray diffraction structures extracted from the PDB. The former piepline is implemented using the canpredict de novo module, and the latter is implemented using the canpredict similarity module, in the cando.py Python package (Mangione et al., 2020a; Mangione and Falls, 2022)).

3.3 Compound-protein interaction calculation

We utilized our in-house bioinformatic analytics-based docking protocol BANDOCK to generate interaction scores between every compound and every protein structure; these scores serve as a proxy for binding strength/probability (Minie et al., 2014; Sethi et al., 2015; Falls et al., 2019; Hudson and Samudrala, 2021). The COACH algorithm from the I-TASSER suite (Yang et al., 2013) was used to predict binding sites for each protein. COACH outputs an associated score and binding ligand for every binding site in a protein and is the primary data used by BANDOCK to generate interaction scores. For a given compound and protein pair, every interacting ligand predicted by COACH is compared to the query compound by computing the similarity coefficient of their chemical fingerprints generated via RDKit (Landrum, 2013). The maximum resulting coefficient (i.e. the strongest match) and its associated binding site score are then used to compute the final interaction score for the compound-protein pair, depending on the scoring protocol parameters. This is repeated iteratively for each protein in a given library (e.g. SARS-CoV, SARS-CoV-2, human, nonredundant PDB), resulting in a proteomic interaction signature for every drug/compound, represented an N × M matrix, where N is the number of drugs/compounds and M is the number of proteins.

Interaction scoring (BANDOCK) parameters were systematically varied to identify those optimal for assessing anti-coronavirus activity. These were 1) the chemical fingerprinting method: ECFP or functional-class fingerprint (FCFP) with diameters of 0, 2, 4, 6, 8, and 10 and length of 2048; 2) the fingerprint style: binary vs integer based for the compounds/ligands; 3) the scoring protocol: the binding site score from COACH (Pscore), the Tanimoto or Sorenson-Dice coefficient of the binding site ligand from COACH to the query drug (Cscore) for binary or integer fingerprints, respectively, the percentile of the Cscore in the distribution of all I-TASSER ligand Cscores to the query drug (dCscore), or products of these (Pscore × Cscore, Pscore × dCscore); and 4) thresholds: Pscore and Cscore (or dCscore) thresholds so that any binding site or compound-ligand similarity coefficient (or its percentile) that does not exceed each cutoff, respectively, are ignored. A compound-protein interaction matrix was generated for each of these parameter combinations.

Computed interaction scores with the 18 SARS-CoV proteins were used for compound-protein scoring protocol parameter optimization, while the scores computed (using the parameters identified in the previous step) with the 18 SARS-CoV-2 proteins were used for the de novo candidate generation pipeline. The scores computed with a library of 5,317 human PDB structures were used for the similarity-based pipelines (see section 3.5). The initial parameters were an ECFP4 binary fingerprint with Tanimoto coefficients for Cscores, Pscore scoring protocol, and a dCscore threshold of 0.5 (50th percentile), which were used to generate the March 5 2020 aka 3.5.20 list of candidates. The enhanced parameters were an ECFP4 integer fingerprint with Sorenson-Dice coefficient for Cscores, Pscore × dCscore scoring protocol, and a dCscore threshold of 0.75 (75th percentile), which were used to generate the May 18, 2020 aka 3.18.20 candidate list.

3.4 Parameter optimization using coronavirus active compound recovery

We identified the best parameters for BANDOCK that optimally ranked the compounds identified via high throughput screens against three different coronavirus species (SARS-CoV, HCoV-NL63, and HCoV-OC43), each of which were assessed separately via de novo drug candidate generation. We also varied the cutoff threshold of interaction scores to consider so that the interaction scores with proteins below that threshold were not considered in the total for a given compound. The cutoffs in this study were incremented by 0.05, starting with 0.0 (no threshold) and ending with 1.0 (maximum score). The discounted cumulative gain metric (Järvelin and Kekäläinen, 2002; Dupret, 2011), often employed for search engine optimization and other early recognition problems, was used to assess how well each matrix properly ranked the active compounds in the proper order given their associated EC50/IC50 values from each of the three species separately. Our previous work has identified this metric as the optimal one for drug repurposing studies (Schuler et al., 2021). Briefly, discounted cumulative gain (DCG) rewards lists of predictions that rank the optimal known actives at the top and progressively penalizes lower ranked ones via the equation:

where p is the length of the list, i is the rank, and rel_i is the relevance score of the item at position/rank i which is the inverse of the EC50 values (1/EC50) for the 36 nonredundant actives.

Parameter sets utilizing any of the following criteria were discarded due to trivial candidate rankings: Pscore scoring protocol, interaction score threshold of 1.0, and Cscore threshold of 1.0. Interaction scores generated using the Pscore protocol did not utilize the chemical fingerprint similarity value between the binding site ligand and the query compound and subsequently failed to discriminate between two compounds that used the same ligand. Using an interaction score or Cscore threshold of 1.0 required the chemical fingerprint similarity score to equal 1.0, meaning identical compounds, therefore ensuring the only predicted candidates were known coronavirus inhibitors.

3.5 COVID-19 drug candidate generation

To generate drug candidates against COVID-19, we used both a de novo pipeline that ranked compounds based on their predicted interaction scores against proteins from SARS-CoV-2, and a similarity pipeline that searched the CANDO drug/compound library for compounds similar to those deemed as actives in terms of their interaction signatures. The former protocol summed the computed interaction scores of each compound against all viral proteins and ranked them from best to worst. Interaction scores below particular thresholds were ignored in the sums (see section 3.4). For the initial iteration of the latter similarity protocol, drug candidates were ranked by their frequency of occurrence in the top 25 most similar compounds to each of the 38 coronavirus actives, while the enhanced iteration ranked compounds by frequency of occurrence in the top 10 most similar compounds to the 85 GHDDI actives. We kept track of the number of coronavirus actives each compound was similar to within the cutoff threshold along with their average ranks (which served as a tie-breaker) to produce the final ranked list of candidates.

The outputs of our pipelines were three ranked lists of drug candidates: one using the direct viral inhibition pipeline from the initial iteration (3.5.20 de novo), a second using the similarity based candidate generation pipeline from the initial iteration (3.5.20 similarity), and the third using the similarity based pipeline using the enhanced actives list (5.18.20 similarity).

3.6 External validation studies curation

We analyzed GHDDI (Leng, 2020) and CoronaCentral (Lever and Altman, 2021) for up-to-date information on COVID-19 therapeutic interventions which could independently and prospectively validate our top ranked candidates. Both sources utilize deep learning or natural language processing methods to automatically extract and annotate information from SARS-CoV-2 studies to produce lists of possible actives. We manually parsed the manuscripts that were annotated with and matched the name of any candidate compounds from our three prediction lists for corresponding efficacy values (EC50, IC50, hazard ratios, etc) while eliminating studies that were purely computational or did not investigate the candidate compound as the primary intervention.

4 Conclusion

This study highlights how CANDO may be used to rapidly generate promising leads for drug development when time is critical, provided the therapeutic intervention is possible within established dosing guidelines. Our study is an assessment of potential therapeutics for treating COVID-19 which were all generated within three months of the pandemic declaration by the WHO. Considering that it took almost one year for a vaccine (Food and Administration, 2022) and two years for a potent antiviral such as molnupiravir or nirmatrelvir (Mahase, 2021; Hammond et al., 2022) to become available, we have exemplified that computational drug discovery and repurposing platforms like ours can be strategically used to alleviate the burden of emergent pathogens ahead of time. Additional studies, ideally via in vivo and/or clinical studies, verifying the efficacy of these identified candidates is necessary in most cases, however for already approved drug candidates such as those explored in this study the need for trials demonstrating safety is greatly diminished. Additionally, retrospective EHR analysis may also be used to indirectly examine clinical benefits in human patients as in the case of fluoxetine (Oskotsky et al., 2021).

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: http://compbio.buffalo.edu/data/mc_cando_covid19/.

Author contributions

WM conceived the prediction pipelines, research design, approach and methods, conducted all experiments and analysis, implemented all pipelines, and drafted the manuscript. ZF helped with data generation, research design, approach, and methods, and editing the manuscript. RS conceived the prediction pipelines, research design, approach and methods, edited the manuscript, and supervised the overall project. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by a NIH Director’s Pioneer Award (DP1OD006779), a NIH Clinical and Translational Sciences Award (UL1TR001412), NIH T15 Award (T15LM012495), an NCATS ASPIRE Design Challenge Award, an NCATS ASPIRE Reduction-to-Practice Award, and startup funds from the Department of Biomedical Informatics at the University at Buffalo.

Acknowledgments

The authors would like to acknowledge the support provided by the Center for Computational Research at the University at Buffalo. We would also like to thank all members of the Samudrala Computational Biology Group.

Conflict of interest

A patent has been filled with the United States Patent and Trademark Office (USPTO Application number: 63/120,633) claiming some of the small molecule compounds identified using the approach discussed in this manuscript for the treatment of COVID-19, and were validated by Ennaid Therapeutics, LLC in a propreitary study. The compounds exclusive to the patent are not included in the list of the 51 validated actives and are not discussed further in the manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2022.970494/full#supplementary-material

References

1
ArdakaniA. A.KanafiA. R.AcharyaU. R.KhademN.MohammadiA. (2020). Application of deep learning technique to manage Covid-19 in routine clinical practice using ct images: Results of 10 convolutional neural networks. Comput. Biol. Med.121, 103795. 10.1016/j.compbiomed.2020.103795
- CrossRef
- Google Scholar
2
BabyK.MaityS.MehtaC. H.SureshA.NayakU. Y.NayakY.et al (2021). Targeting sars-cov-2 main protease: A computational drug repurposing study. Arch. Med. Res.52, 38–47. 10.1016/j.arcmed.2020.09.013
- CrossRef
- Google Scholar
3
BurleyS. K.BermanH. M.BhikadiyaC.BiC.ChenL.Di CostanzoL.et al (2019). Rcsb protein Data Bank: Biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy. Nucleic Acids Res.47, D464–D474. 10.1093/nar/gky1004
- CrossRef
- Google Scholar
4
ChopraG.KaushikS.ElkinP. L.SamudralaR. (2016). Combating ebola with repurposed therapeutics using the cando platform. Molecules21, 1537. 10.3390/molecules21121537
- CrossRef
- Google Scholar
5
ChopraG.SamudralaR. (2016). Exploring polypharmacology in drug discovery and repurposing using the cando platform. Curr. Pharm. Des.22, 3109–3123. 10.2174/1381612822666160325121943
- CrossRef
- Google Scholar
6
ChowdhuryM. S.RathodJ.GernsheimerJ. (2020). A rapid systematic review of clinical trials utilizing chloroquine and hydroxychloroquine as a treatment for Covid-19. Acad. Emerg. Med.27, 493–504. 10.1111/acem.14005
- CrossRef
- Google Scholar
7
ChoyK.-T.WongA. Y.-L.KaewpreedeeP.SiaS. F.ChenD.HuiK. P. Y.et al (2020). Remdesivir, lopinavir, emetine, and homoharringtonine inhibit sars-cov-2 replication in vitro. Antivir. Res.178, 104786. 10.1016/j.antiviral.2020.104786
- CrossRef
- Google Scholar
8
CoenenS.van Der VeldenA. W.CianciD.GoossensH.BongardE.SavilleB. R.et al (2020). Oseltamivir for coronavirus illness: Post-hoc exploratory analysis of an open-label, pragmatic, randomised controlled trial in European primary care from 2016 to 2018. Br. J. Gen. Pract.70, e444–e449. 10.3399/bjgp20X711941
- CrossRef
- Google Scholar
9
CostinJ. M.JenwitheesukE.LokS.-M.HunspergerE.ConradsK. A.FontaineK. A.et al (2010). Structural optimization and de novo design of dengue virus entry inhibitory peptides. PLoS Negl. Trop. Dis.4, e721. 10.1371/journal.pntd.0000721
- CrossRef
- Google Scholar
10
DupretG. (2011). Discounted cumulative gain and user decision models. International Symposium on String Processing and Information Retrieval. Springer, 2–13.
- Google Scholar
11
DyallJ.ColemanC. M.VenkataramanT.HolbrookM. R.KindrachukJ.JohnsonR. F.et al (2014). Repurposing of clinically developed drugs for treatment of Middle East respiratory syndrome coronavirus infection. Antimicrob. Agents Chemother.58, 4885–4893. 10.1128/AAC.03036-14
- CrossRef
- Google Scholar
12
Echeverría-EsnalD.Martin-OntiyueloC.Navarrete-RoucoM. E.De-Antonio CuscoM.FerrándezO.HorcajadaJ. P.et al (2021). Azithromycin in the treatment of Covid-19: A review. Expert Rev. anti. Infect. Ther.19, 147–163. 10.1080/14787210.2020.1813024
- CrossRef
- Google Scholar
13
EllingerB.BojkovaD.ZalianiA.CinatlJ.ClaussenC.WesthausS.et al (2021). A sars-cov-2 cytopathicity dataset generated by high-content screening of a large drug repurposing collection. Sci. Data8, 70. 10.1038/s41597-021-00848-4
- CrossRef
- Google Scholar
14
FallsZ.MangioneW.SchulerJ.SamudralaR. (2019). Exploration of interaction scoring criteria in the cando platform. BMC Res. Notes12, 318. 10.1186/s13104-019-4356-3
- CrossRef
- Google Scholar
15
FineJ.LacknerR.SamudralaR.ChopraG. (2019). Computational chemoproteomics to understand the role of selected psychoactives in treating mental health indications. Sci. Rep.9, 13155. 10.1038/s41598-019-49515-0
- CrossRef
- Google Scholar
16
Food and Administration (2022). Comirnaty and pfizer-biontech covid-19 vaccine.
- Google Scholar
17
GhandikotaS.SharmaM.JeggaA. G. (2021). Secondary analysis of transcriptomes of sars-cov-2 infection models to characterize Covid-19. Patterns2, 100247. 10.1016/j.patter.2021.100247
- CrossRef
- Google Scholar
18
GroupR. C. (2021). Dexamethasone in hospitalized patients with Covid-19. N. Engl. J. Med. Overseas. Ed.384, 693–704. 10.1056/nejmoa2021436
- CrossRef
- Google Scholar
19
Group (2020). Cando platform putative drug candidates against covid-19.
- Google Scholar
20
GysiD. M.Do ValleI.ZitnikM.AmeliA.GanX.VarolO.et al (2021). Network medicine framework for identifying drug-repurposing opportunities for Covid-19. Proc. Natl. Acad. Sci. U. S. A.118, e2025581118. 10.1073/pnas.2025581118
- CrossRef
- Google Scholar
21
HammondJ.Leister-TebbeH.GardnerA.AbreuP.BaoW.WisemandleW.et al (2022). Oral nirmatrelvir for high-risk, nonhospitalized adults with Covid-19. N. Engl. J. Med. Overseas. Ed.386, 1397–1408. 10.1056/nejmoa2118542
- CrossRef
- Google Scholar
22
HorstJ. A.LaurenziA.BernardB.SamudralaR. (2012). Computational multitarget drug discovery. Polypharmacology Drug Discov., 263–301. 10.1002/9781118098141.ch13
- CrossRef
- Google Scholar
23
HudsonM. L.SamudralaR. (2021). Multiscale virtual screening optimization for shotgun drug repurposing using the cando platform. Molecules26, 2581. 10.3390/molecules26092581
- CrossRef
- Google Scholar
24
JärvelinK.KekäläinenJ. (2002). Cumulated gain-based evaluation of ir techniques. ACM Trans. Inf. Syst.20, 422–446. 10.1145/582415.582418
- CrossRef
- Google Scholar
25
JenwitheesukE.HorstJ. A.RivasK. L.Van VoorhisW. C.SamudralaR. (2008). Novel paradigms for drug discovery: Computational multitarget screening. Trends Pharmacol. Sci.29, 62–71. 10.1016/j.tips.2007.11.007
- CrossRef
- Google Scholar
26
JenwitheesukE.SamudralaR. (2005). Identification of potential multitarget antimalarial drugs. JAMA294, 1490–1491. 10.1001/jama.294.12.1490
- CrossRef
- Google Scholar
27
JenwitheesukE.SamudralaR. (2003a). Identifying inhibitors of the sars coronavirus proteinase. Bioorg. Med. Chem. Lett.13, 3989–3992. 10.1016/j.bmcl.2003.08.066
- CrossRef
- Google Scholar
28
JenwitheesukE.SamudralaR. (2003b). Improved prediction of hiv-1 protease-inhibitor binding energies by molecular dynamics simulations. BMC Struct. Biol.3, 2. 10.1186/1472-6807-3-2
- CrossRef
- Google Scholar
29
LandrumG. (2013). Rdkit: A software suite for cheminformatics, computational chemistry, and predictive modeling.
- Google Scholar
30
LengL. D. (2020). Targeting covid-19: Ghddi info sharing portal.
- Google Scholar
31
LeverJ.AltmanR. B. (2021). Analyzing the vast coronavirus literature with CoronaCentral. Proc. Natl. Acad. Sci. U. S. A.118, e2100766118. 10.1073/pnas.2100766118
- CrossRef
- Google Scholar
32
LiuY.WuY.ShenX.XieL. (2021). Covid-19 multi-targeted drug repurposing using few-shot learning. Front. Bioinform.1, 18. 10.3389/fbinf.2021.693177
- CrossRef
- Google Scholar
33
MahaseE. (2021). Covid-19: Molnupiravir reduces risk of hospital admission or death by 50% in patients at risk, msd reports.
- Google Scholar
34
MangioneW. (2022). Comprehensive elucidation of small molecule therapeutic behavior using multitarget theory. Ann Arbor, Michigan: Ph.D. thesis, University. at Buffalo. Copyright - Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works; Last updated - 2022-03-21.
- Google Scholar
35
MangioneW.FallsZ. (2022). cando.py.
- Google Scholar
36
MangioneW.FallsZ.ChopraG.SamudralaR. (2020a). cando. py: Open source software for predictive bioanalytics of large scale drug–protein–disease data. J. Chem. Inf. Model.60, 4131–4136. 10.1021/acs.jcim.0c00110
- CrossRef
- Google Scholar
37
MangioneW.FallsZ.MelendyT.ChopraG.SamudralaR. (2020b). Shotgun drug repurposing biotechnology to tackle epidemics and pandemics. Drug Discov. Today25, 1126–1128. 10.1016/j.drudis.2020.05.002
- CrossRef
- Google Scholar
38
MangioneW.FallsZ.SamudralaR. (2022). Effective holistic characterization of small molecule effects using heterogeneous biological networks. bioRxiv.
- Google Scholar
39
MangioneW.SamudralaR. (2019). Identifying protein features responsible for improved drug repurposing accuracies using the cando platform: Implications for drug design. Molecules24, 167. 10.3390/molecules24010167
- CrossRef
- Google Scholar
40
MichaelS.IsernS.GarryR.CostinJ.JenwithesukE.SamudralaR. (2011a). Optimized dengue virus entry inhibitory peptide (dn81).
- Google Scholar
41
MichaelS.IsernS.GarryR.CostinJ.JenwithesukE.SamudralaR. (2011b). Optimized dengue virus entry inhibitory peptide(1oan1).
- Google Scholar
42
MinieM.ChopraG.SethiG.HorstJ.WhiteG.RoyA.et al (2014). Cando and the infinite drug discovery frontier. Drug Discov. Today19, 1353–1363. 10.1016/j.drudis.2014.06.018
- CrossRef
- Google Scholar
43
MohamedK.YazdanpanahN.SaghazadehA.RezaeiN. (2021). Computational drug discovery and repurposing for the treatment of Covid-19: A systematic review. Bioorg. Chem.106, 104490. 10.1016/j.bioorg.2020.104490
- CrossRef
- Google Scholar
44
MoukheiberL.MangioneW.MalekiS.FallsZ.GaoM.SamudralaR. (2021). Identifying protein features and pathways responsible for toxicity using machine learning, cando, and tox21 datasets: Implications for predictive toxicology. bioRxiv.
- Google Scholar
45
NicholsonC. O.CostinJ. M.RoweD. K.LinL.JenwitheesukE.SamudralaR.et al (2011). Viral entry inhibitors block dengue antibody-dependent enhancement in vitro. Antivir. Res.89, 71–74. 10.1016/j.antiviral.2010.11.008
- CrossRef
- Google Scholar
46
Organization (2022). Who coronavirus (covid-19) dashboard.
- Google Scholar
47
OskotskyT.MarićI.TangA.OskotskyB.WongR. J.AghaeepourN.et al (2021). Mortality risk among patients with Covid-19 prescribed selective serotonin reuptake inhibitor antidepressants. JAMA Netw. Open4, e2133090. 10.1001/jamanetworkopen.2021.33090
- CrossRef
- Google Scholar
48
OverhoffB.FallsZ.MangioneW.SamudralaR. (2021). A deep-learning proteomic-scale approach for drug design. Pharmaceuticals14, 1277. 10.3390/ph14121277
- CrossRef
- Google Scholar
49
OzturkT.TaloM.YildirimE. A.BalogluU. B.YildirimO.AcharyaU. R.et al (2020). Automated detection of Covid-19 cases using deep neural networks with x-ray images. Comput. Biol. Med.121, 103792. 10.1016/j.compbiomed.2020.103792
- CrossRef
- Google Scholar
50
PhamT.-H.QiuY.ZengJ.XieL.ZhangP. (2021). A deep learning framework for high-throughput mechanism-driven phenotype compound screening and its application to Covid-19 drug repurposing. Nat. Mach. Intell.3, 247–257. 10.1038/s42256-020-00285-9
- CrossRef
- Google Scholar
51
SamudralaR.FallsZ.MangioneW. (2020). Coronavirus treatment compositions and methods.
- Google Scholar
52
SchulerJ.FallsZ.MangioneW.HudsonM. L.BruggemannL.SamudralaR.et al (2021). Evaluating the performance of drug-repurposing technologies. Drug Discov. Today27, 49–64. 10.1016/j.drudis.2021.08.002
- CrossRef
- Google Scholar
53
SchulerJ.MangioneW.SamudralaR.CeustersW. (2019). Foundations for a realism-based drug repurposing ontology. In 10th Annual International Conference on Biomedical Ontology. 1–8.
- Google Scholar
54
SchulerJ.SamudralaR. (2019). Fingerprinting cando: Increased accuracy with structure-and ligand-based shotgun drug repurposing. ACS omega4, 17393–17403. 10.1021/acsomega.9b02160
- CrossRef
- Google Scholar
55
SethiG.ChopraG.SamudralaR. (2015). Multiscale modelling of relationships between protein classes and drug behavior across all diseases using the cando platform. Mini Rev. Med. Chem.15, 705–717. 10.2174/1389557515666150219145148
- CrossRef
- Google Scholar
56
ShenL.NiuJ.WangC.HuangB.WangW.ZhuN.et al (2019). High-throughput screening and identification of potent broad-spectrum inhibitors of coronaviruses. J. Virol.93, e0002319. 10.1128/JVI.00023-19
- CrossRef
- Google Scholar
57
SpinnerC. D.GottliebR. L.CrinerG. J.LópezJ. R. A.CattelanA. M.ViladomiuA. S.et al (2020). Effect of remdesivir vs standard care on clinical status at 11 days in patients with moderate Covid-19: A randomized clinical trial. Jama324, 1048–1057. 10.1001/jama.2020.16349
- CrossRef
- Google Scholar
58
VijayanV.PantP.VikramN.KaurP.SinghT.SharmaS.et al (2020). Identification of promising drug candidates against nsp16 of sars-cov-2 through computational drug repurposing study. J. Biomol. Struct. Dyn.39, 6713–6727. 10.1080/07391102.2020.1802349
- CrossRef
- Google Scholar
59
WangJ. (2020). Fast identification of possible drug treatment of coronavirus disease-19 (Covid-19) through computational drug repurposing study. J. Chem. Inf. Model.60, 3277–3286. 10.1021/acs.jcim.0c00179
- CrossRef
- Google Scholar
60
WangM.CaoR.ZhangL.YangX.LiuJ.XuM.et al (2020a). Remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus (2019-ncov) in vitro. Cell Res.30, 269–271. 10.1038/s41422-020-0282-0
- CrossRef
- Google Scholar
61
WangY.ZhangD.DuG.DuR.ZhaoJ.JinY.et al (2020b). Remdesivir in adults with severe Covid-19: A randomised, double-blind, placebo-controlled, multicentre trial. Lancet395, 1569–1578. 10.1016/S0140-6736(20)31022-9
- CrossRef
- Google Scholar
62
WishartD. S.FeunangY. D.GuoA. C.LoE. J.MarcuA.GrantJ. R.et al (2018). Drugbank 5.0: A major update to the drugbank database for 2018. Nucleic Acids Res.46, D1074–D1082. 10.1093/nar/gkx1037
- CrossRef
- Google Scholar
63
YangJ.RoyA.ZhangY. (2013). Protein–ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment. Bioinformatics29, 2588–2595. 10.1093/bioinformatics/btt447
- CrossRef
- Google Scholar
64
YangJ.YanR.RoyA.XuD.PoissonJ.ZhangY.et al (2015). The i-tasser suite: Protein structure and function prediction. Nat. Methods12, 7–8. 10.1038/nmeth.3213
- CrossRef
- Google Scholar
65
ZhouY.HouY.ShenJ.HuangY.MartinW.ChengF.et al (2020). Network-based drug repurposing for novel coronavirus 2019-ncov/sars-cov-2. Cell Discov.6, 14. 10.1038/s41421-020-0153-3
- CrossRef
- Google Scholar

Summary

Keywords

COVID-19, SARS-CoV-2, drug discovery, multitargeting, computational drug repurposing, computational biology

Citation

Mangione W, Falls Z and Samudrala R (2022) Optimal COVID-19 therapeutic candidate discovery using the CANDO platform. Front. Pharmacol. 13:970494. doi: 10.3389/fphar.2022.970494

Received

16 June 2022

Accepted

07 July 2022

Published

25 August 2022

Volume

13 - 2022

Edited by

Mithun Rudrapal, Rasiklal M. Dhariwal Institute of Pharmaceutical Education and Research, India

Reviewed by

Harun Patel, R. C. Patel Institute of Pharmaceutical Education and Research, India

Abdul Issahaku, University of KwaZulu-Natal, South Africa

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ram Samudrala, ram@compbio.org

This article was submitted to Drugs Outcomes Research and Policies, a section of the journal Frontiers in Pharmacology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Drugs Outcomes Research and Policies

ORIGINAL RESEARCH article

Optimal COVID-19 therapeutic candidate discovery using the CANDO platform

Abstract

1 Introduction

2 Results and discussion

2.1 Compound-protein interaction protocol parameter optimization

2.2 Generation and validation of drug candidates

3 Methods

3.1 Compound structure library and known actives curation

3.2 Protein structure library curation

3.3 Compound-protein interaction calculation

3.4 Parameter optimization using coronavirus active compound recovery

3.5 COVID-19 drug candidate generation

3.6 External validation studies curation

4 Conclusion

Statements

Data availability statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

References

Summary

Outline

Figures

Cite article

Article metrics

ORIGINAL RESEARCH article

Optimal COVID-19 therapeutic candidate discovery using the CANDO platform

Abstract

1 Introduction

2 Results and discussion

2.1 Compound-protein interaction protocol parameter optimization

2.2 Generation and validation of drug candidates

3 Methods

3.1 Compound structure library and known actives curation

3.2 Protein structure library curation

3.3 Compound-protein interaction calculation

3.4 Parameter optimization using coronavirus active compound recovery

3.5 COVID-19 drug candidate generation

3.6 External validation studies curation

4 Conclusion

Statements

Data availability statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

References

Summary

Outline

Figures

Cite article

Share article

Article metrics