REVIEW article

Front. Mol. Biosci., 07 May 2021

Sec. Biological Modeling and Simulation

Volume 8 - 2021 | https://doi.org/10.3389/fmolb.2021.663301

Structure-Guided Computational Approaches to Unravel Druggable Proteomic Landscape of Mycobacterium leprae

  • 1. Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom

  • 2. Rutherford Appleton Laboratory, Science and Technology Facilities Council, Oxon, United Kingdom

  • 3. Laboratório de Modelagem e Dinâmica Molecular, Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil

Abstract

Leprosy, caused by Mycobacterium leprae (M. leprae), is treated with a multidrug regimen comprising Dapsone, Rifampicin, and Clofazimine. These drugs exhibit bacteriostatic, bactericidal and anti-inflammatory properties, respectively, and control the dissemination of infection in the host. However, the current treatment is not cost-effective, does not favor patient compliance due to its long duration (12 months) and does not protect against the incumbent nerve damage, which is a severe leprosy complication. The chronic infectious peripheral neuropathy associated with the disease is primarily due to the bacterial components infiltrating the Schwann cells that protect neuronal axons, thereby inducing a demyelinating phenotype. There is a need to discover novel/repurposed drugs that can act as short duration and effective alternatives to the existing treatment regimens, preventing nerve damage and consequent disability associated with the disease. Mycobacterium leprae is an obligate pathogen resulting in experimental intractability to cultivate the bacillus in vitro and limiting drug discovery efforts to repositioning screens in mouse footpad models. The dearth of knowledge related to structural proteomics of M. leprae, coupled with emerging antimicrobial resistance to all the three drugs in the multidrug therapy, poses a need for concerted novel drug discovery efforts. A comprehensive understanding of the proteomic landscape of M. leprae is indispensable to unravel druggable targets that are essential for bacterial survival and predilection of human neuronal Schwann cells. Of the 1,614 protein-coding genes in the genome of M. leprae, only 17 protein structures are available in the Protein Data Bank. In this review, we discussed efforts made to model the proteome of M. leprae using a suite of software for protein modeling that has been developed in the Blundell laboratory. Precise template selection by employing sequence-structure homology recognition software, multi-template modeling of the monomeric models and accurate quality assessment are the hallmarks of the modeling process. Tools that map interfaces and enable building of homo-oligomers are discussed in the context of interface stability. Other software is described to determine the druggable proteome by using information related to the chokepoint analysis of the metabolic pathways, gene essentiality, homology to human proteins, functional sites, druggable pockets and fragment hotspot maps.

Introduction

Mycobacterium leprae causes leprosy in about 200,000 people each year globally. Leprosy is a dermato-neurological infectious disease with varied clinical manifestations, often resulting in peripheral sensorimotor/demyelinating neuropathy leading to permanent nerve damage and disability. The World Health Organization currently recommends a combinatorial therapy [multidrug therapy (MDT)] with Dapsone, Rifampicin (Rifampin) and Clofazimine to treat leprosy (Manglani and Arif, 2006). MDT has proven effective in reducing the prevalence and controlling the incidence from about 5 million new cases in the 1990s to ∼200,000 new cases from the year 2005 (after India declared the elimination of leprosy). However, with the emergence of single and multidrug-resistant strains of M. leprae, novel therapies are essential to curb ongoing transmission of the disease. Also, the current therapy duration with MDT is 1 year leading to reduced treatment compliance and increased defaulter rates globally (Cambau et al., 2018).

Mycobacterium leprae is phylogenetically the closest bacterial species to Mycobacterium tuberculosis (M. tuberculosis). However, the M. leprae has a reduced genome of 3.2 Mbp, compared to 4.4 Mbp in M. tuberculosis, and survive with only 1,614 protein coding genes which are largely annotated based the features of their homologues in M. tuberculosis and other mycobacterial species (Cole et al., 2001). Dapsone interacts with bacterial dihydropteroate synthase, an enzyme essential for folic acid biosynthesis in bacteria. It is absent in humans (Cambau et al., 2006). Rifampin interacts with RNA polymerase, an enzyme critical for DNA dependent RNA synthesis (transcription) in M. leprae. These drugs are either bacteriostatic or bactericidal. However, they do not interfere with predilection of M. leprae for human nerve cells, which is a significant cause for demyelinating neuropathy in leprosy (Lockwood and Saunderson, 2012). Newer antibacterial agents that can effectively treat the disease within a short duration of time and prevent nerve damage are essential to reduce morbidity associated with the disease (Rao and Jain, 2013). Currently known drugs for leprosy, their drug target proteins and references related to their mechanisms of action are listed in Table 1.

TABLE 1

DrugTarget proteins/Ribosomal subunitsGene (gene name)References
DapsoneDihydropteroate synthase (DHPS)folP1 (ML0224)Williams et al., 2000
Rifampinβ-subunit of the DNA-dependent RNA polymeraserpoB (ML1891)Lin et al., 2017
ClofazimineUnknown-Lechartier and Cole, 2015
FluoroquinolonesDNA gyrase subunit AgyrA (ML0006)Blower et al., 2016
DNA gyrase subunit BgyrB (ML0005)Yamaguchi et al., 2016
Macrolides50S subunit (23S rRNA in particular)-Ji et al., 1996
Minocycline30S ribosomal subunit, blocking the binding of aminoacyl-tRNA to the 16S rRNA-Ji et al., 1996
ThioamidesEnoyl-ACP-reductaseinhA (ML1806)Wang et al., 2007
BedaquilineProton pump of ATP synthaseatpE (ML1140)Guo et al., 2021
EpiroprimDihydrofolate reductasefolA (ML1518)Dhople, 2002

Drugs and their corresponding target proteins in M. leprae.

Knowledge of the structural components of the proteome of M. leprae is critical for identifying drug target proteins and deciphering their essential roles in the survival of the pathogen. Key enzymes that catalyze chokepoint reactions can act as potential drug targets for antimycobacterial discovery. However, information related to 3D structures of these proteins is scarce for M. leprae, necessitating a more focussed structural genomics effort to unravel the druggable proteomic landscape of this bacillus long known to humankind.

Software tools that predict stability and affinity changes in drug-target proteins due to substitution mutations are discussed in the context of antimicrobial resistance. While the emphasis is on deciphering the druggable proteome, we provide a brief overview of the structure-guided virtual screening tools and methods that facilitate the chemical expansion of fragment-like small molecules to lead-like or drug-like compounds in the active or allosteric sites of the target protein.

Proteome Modeling in Mycobacterium leprae and Its Relevance to Structure-Guided Drug Discovery

Of the 1,614 annotated genes that are expressed in M. leprae, the structures of only 17 proteins are available (see Table 2) to date in the publicly available databases [Protein Data Bank (PDB) (Berman et al., 2000)], as opposed to around 1,277 entries for Mycobacterium tuberculosis. Solving the crystal/cryoEM structures of all the potential drug targets in M. leprae requires costly and labor intensive effort. Given the high sequence identity of many of the M. leprae proteins with their homologous counterparts in M. tuberculosis with solved structures in the PDB, employing computational tools to perform comparative modeling of proteins in M. leprae can be a robust alternative for acquiring a preliminary understanding of the functional sites and small molecule interactions.

TABLE 2

Gene IdPDB IdDescriptionReferences
ML24414EO9Crystal structure of a phosphoglycerate mutase gpm1 from Mycobacterium lepraeBaugh et al., 2015
ML02104ECPX-ray crystal structure of Inorganic Pyrophosphate PPA from Mycobacterium lepraeUnpublished
ML05604J07Crystal structure of a PROBABLE RIBOFLAVIN SYNTHASE, BETA CHAIN RIBH (6,7-dimethyl-8-ribityllumazine synthase, DMRL synthase, Lumazine synthase) from Mycobacterium lepraeUnpublished
ML13825IE8The pyrazinoic acid binding domain of Ribosomal Protein S1 from Mycobacterium tuberculosis*Huang B. et al., 2016
ML04821BVSRUVA Complexed to a Holliday JunctionRoe et al., 1998
ML26843AFPCrystal structure of the single-stranded DNA binding protein from Mycobacterium leprae (Form I)”Kaushal et al., 2010
ML26402CKDCrystal structure of ML2640 from Mycobacterium lepraeGraña et al., 2007
ML03801LEPThree-Dimensional Structure of the Immunodominant Heat-Shock Protein Chaperonin-10 of Mycobacterium LepraeMande et al., 1996
ML19623I4OCrystal Structure of Translation Initiation Factor 1 from Mycobacterium tuberculosis*Hatzopoulos and Mueller-Dieckmann, 2010
ML2428A5O61The complete structure of the Mycobacterium smegmatis 70S ribosome*Hentschel et al., 2017
ML14854WKWCrystal Structure of a Conserved Hypothetical Protein from Mycobacterium leprae Determined by Iodide SAD PhasingUnpublished
ML21743R2NCrystal structure of cytidine deaminase from Mycobacterium lepraeBaugh et al., 2015
ML18062NTVMycobacterium leprae InhA bound with PTH-NAD adductWang et al., 2007
ML26843AFQCrystal structure of the single-stranded DNA binding protein from Mycobacterium leprae (Form II)Kaushal et al., 2010
ML20694EX4The Structure of GlcB from Mycobacterium lepraeUnpublished
ML26402UYOCrystal structure of ML2640c from Mycobacterium leprae in an hexagonal crystal formGraña et al., 2007
ML26402UYQCrystal structure of ML2640c from Mycobacterium leprae in complex with S-adenosylmethionineGraña et al., 2007

List of protein structures available for M. leprae in Protein Data Bank

*The solved region of the protein structure is 100% in sequence identity with M. leprae.

Different groups have made several attempts to model the proteins of M. leprae. Table 3 lists two web-resources where such information is available. Computational protein structure modeling can reduce the sequence-structure gaps and enable genome-scale modeling of infectious pathogens (Bienert et al., 2017). Although the paucity of structural proteomics information for M. leprae in the publicly available databases is a challenge, the software developed in the Blundell laboratory will be useful in performing proteome scale modeling pipeline (Vivace) for proteomes of Mycobacterial pathogens and other bacterial species (Skwark et al., 2019). Vivace optimizes template selection, enables sequence-structure alignments, and constructs optimal quality models in both apo- and ligand-bound states. To facilitate multi-template modeling, protein structures from the entire PDB are initially organized in a structural profile database named TOCCATA (Ochoa-Montaño et al., 2015). Protein structures within each profile are classified based on domain annotations in CATH (Sillitoe et al., 2019) and SCOP (Andreeva et al., 2020) databases, pre-aligned and functionally annotated with information derived from UniProt (The UniProt Consortium, 2021) and PDB. The query protein sequence is aligned with representative structures from each cluster using a sequence-structure alignment tool named FUGUE (Shi et al., 2001). FUGUE recognizes distant homologues by sequence-structure comparison using environment-specific substitution tables and structure-dependent gap penalties. Alignments generated by FUGUE are fed into Modeler 9.24 (Webb and Sali, 2016) for model building. The ligands and other small molecules are modeled into corresponding protein structure models at sites recognized from the ligand-bound templates. Multiple models are generated based on the number of cluster hits, ranging from 3 to ∼1,000 models per query sequence in the M. leprae proteome. These models are of different states (ligand-bound and apomeric) and of varying quality based on the templates used in each profile.

TABLE 3

Web resourceDescriptionAvailabilityReferences
ModBaseA database of annotated comparative protein structure models and associated resourceshttps://modbase.compbio.ucsf.edu/modbase-cgi/index.cgiPieper et al., 2011
SwissModel RepositoryA database of annotated 3D protein structure models generated by the SWISS-MODEL homology-modeling pipelinehttps://swissmodel.expasy.org/repositoryBienert et al., 2017

Web resources with models of M. leprae proteins (modelled using single templates).

Once modeled, each of the protein structure models undergoes a rigorous quality assessment by employing methods such as NDOPE, GA341 (Shen and Sali, 2006), GOAP (Zhou and Skolnick, 2011), SOAP (Webb and Sali, 2016), Molprobity (Chen et al., 2010) and secondary structure agreement score (Eramian et al., 2006). Models with extensive chain clashes, poorly resolved loops and improperly fitted ligands are ranked low in the consensus quality scoring process described in CHOPIN—a web resource for structural and functional proteome of Mycobacterium tuberculosis (Ochoa-Montaño et al., 2015).

Vivace is being used to model the proteome of M. leprae. Sequence and structural features at the genome-scale are being analyzed to identify essential enzymes that drive chokepoint metabolic reactions. Models in apomeric, ligand-bound and oligomeric (discussed in the later sections) states are being generated and analyzed for surface topology, cavities (Binkowski et al., 2003) and fragment hotspots (sites for potential small molecule binding) (Radoux et al., 2016). The schematic workflow shown in Figure 1 illustrates the modeling procedures adopted by our group to model proteomes of mycobacterial pathogens.

FIGURE 1

Approaches to Predict Homo/Hetero-Oligomeric Complexes

Protein-protein interactions (homo/hetero) govern a majority of the cellular processes. Structure determination of these complexes is crucial for understanding their functions. Usually, the experimental techniques used to unravel interacting protein partners are time consuming, challenging and expensive. There have been significant advances in the development of computational methods and tools to identify interacting pairs and predict the structures of protein-protein complexes (Das and Chakrabarti, 2021).

The computational tools for predicting protein-protein interactions developed over the years can be classified into the knowledge-based or de novo prediction methods. If the structures of the interacting partners are known, the interactions can be predicted using template-based, or template free and/or restraint-based docking. Template-based docking can provide the multi-component modeled complex but requires the presence of multi-component template structures (Ogmen et al., 2005; Mukherjee and Zhang, 2011). If the homologous multi-component template is unavailable, protein-protein docking approaches can be used to sample the conformational space and predict the docked complexes which are further scored using different schemes to discriminate native-like conformations from a pool of docked solutions. These different approaches for computational modeling of protein interactions were recently reviewed by Soni and Madhusudhan (2017).

Recently, tools have been developed which can make use of the wealth of sequence information available for protein sequences to predict/model interactions accurately. Machine learning approaches including deep learning have played a significant role in training models which can predict the interactions using the features derived from protein sequences alone (Huang Y.-A. et al., 2016; Du et al., 2017; Sun et al., 2017; Chen et al., 2019). The inspection of co-evolving sites in two protein partners can provide strong signals to elucidate interacting partners (Yu et al., 2016). A recent method CoFex (Hu and Chan, 2017) used co-evolutionary features to predict protein interactions. Ensemble based approaches which use multiple machine learning methods to vote for predictions have been reported to achieve high sensitivity and accuracy (Zhang et al., 2019; Li et al., 2020). Deep learning has also been employed to train a convolutional neural network (CNN) to predict the protein interacting pairs with high accuracy (Wang et al., 2019; Torrisi et al., 2020).

However, in-silico approaches can often give false positive or negative results as well, hence one also needs validation strategies to assess the quality of predicted interactions. Efforts in the community such as CASP (Critical Assessment of Structure Prediction) and CAPRI (Critical Assessment of Prediction of Interfaces) competitions, aim to assess the field and the state-of-the-art methods and their ability to “correctly” model protein structures and their interactions, respectively. They define and use multiple scores for assessing the quality of protein structure and interfaces in the modeled complexes. CASP14 is the present ongoing competition, where deep learning approach-AlphaFold2 has outperformed and were able to accurately predict the protein structures (AlQuraishi, 2019).

To illustrate the modeling process adopted by Vivace, Figure 2 depicts the models of three potential drug targets in M. leprae, the menB [1,4-dihydroxy-2-naphthoyl-CoA synthase (ML2263)], menD [2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase (ML2270)] and coaA [Pantothenate kinase (ML1954)].

FIGURE 2

The gene product of menB converts o-succinylbenzoyl-CoA (OSB-CoA) to 1,4-dihydroxy-2-naphthoyl-CoA (DHNA-CoA) and its homologue in M. tuberculosis (Rv0548c) is reported as essential (DeJesus et al., 2017). We built the model using the structure of its orthologous protein in M. tuberculosis (PDB Id: 4QII) as the template with sequence identity of 93% and sequence coverage of 100% (Figure 2A). The gene product of menD catalyzes the thiamine diphosphate-dependent decarboxylation of 2-oxoglutarate. Its homologue in M. tuberculosis (Rv0555) is noted to be essential for bacterial survival. We modeled menD of M. leprae using the structure of the M. tuberculosis orthlogue (PDB Id: 5ESD) as the template with the sequence identity of 86% and sequence coverage of 99% (Figure 2B). Finally, we modeled coaA which synthesizes coA from (R)—Pantothenate. CoaA has been recognized as a drug target in tuberculosis (Chiarelli et al., 2018). We modeled coaA using its orthologue in M. tuberculosis (PDB Id: 2GET) as the template with sequence identity of 93% and sequence coverage of 98% (Figure 2C).

Structural Implications of Substitution Mutations

Development of drug resistance is recognized as one of the major hurdles to disease management and control. For M. leprae, it is even more challenging as it relies on mouse footpad models (Vedithi et al., 2018). Antimicrobial resistance was noted in Dapsone, Rifampicin and Ofloxacin (a second-line drug). Treating and managing the disease is a big hurdle due to emerging drug resistance for these three drugs and lack of alternative effective treatments.

The drug-resistance mutations when mapped on to the three-dimensional structure of the drug target can provide crucial insights into effects on protein structure and function. There are several available software/web servers that can predict the impacts of mutation on protein stability and interactions with other proteins, ligands and nucleic acids. We have provided a list of some of the commonly used software tools for investigating the effects of mutations on protein structure and function (Table 4).

TABLE 4

SoftwareDescriptionAvailabilityReferences
SIFTAmino acid substitution effect on protein functionhttps://sift.bii.a-star.edu.sg/Ng and Henikoff, 2003
PolyPhen-2Amino acid substitution effect on protein structure and function using protein sequencehttp://genetics.bwh.harvard.edu/pph2/Adzhubei et al., 2010
SNPs3DAmino acid substitution effect on protein structure and function using SVM based modelhttp://snps3d.org/Yue et al., 2006
MutPred2Machine learning approach to quantify pathogenicity of mutationhttp://mutpred.mutdb.org/index.htmlPejaver et al., 2020
PROVEANImpact of mutation on protein function by using multiple sequence alignmenthttp://provean.jcvi.org/index.phpChoi and Chan, 2015
mCSMEffect of mutation on protein structure and interactions using graph-based signatureshttp://biosig.unimelb.edu.au/mcsm/Pires et al., 2014a
SDM2Effect of mutation on protein structure and interactions using environment-specific amino-acid substitution frequencieshttp://marid.bioc.cam.ac.uk/sdm2Pandurangan et al., 2017, 2
DUETConsensus prediction of mCSM and SDM2 for protein stabilityhttp://biosig.unimelb.edu.au/duet/Pires et al., 2014b
PoPMuSiC-2Effects of mutation on protein stability using statistical potentialshttp://dezyme.com/en/ServicesDehouck et al., 2009
FoldXChange in free energy using force fields-based methodhttp://foldxsuite.crg.eu/Schymkowitz et al., 2005
HunterPredicting protein stability upon mutation using side chain interactionshttp://bioinfo41.weizmann.ac.il/hunter/Potapov et al., 2010
MAESTROMeasures changes in free energy upon mutation using machine learninghttps://pbwww.che.sbg.ac.at/?page_id=416Laimer et al., 2015
I-Mutant3.0SVM based prediction of protein stability change upon mutation using either sequence and/or structurehttp://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant3.0/I-Mutant3.0.cgiCapriotti et al., 2008
MUProSVM and neural network-based prediction of changes in protein stabilityhttp://mupro.proteomics.ics.uci.edu/Cheng et al., 2006
iStableChange in free energy using SVM based predictorhttp://predictor.nchu.edu.tw/istable/Chen et al., 2013
MutaBindChange in free energy using force fields, statistical potentials and side-chain optimisation methodshttps://www.ncbi.nlm.nih.gov/research/mutabind/index.fcgi/Li et al., 2016
BeAtMuSiCImpact of mutations on protein-protein interactions using statistical potentialshttp://babylone.ulb.ac.be/beatmusic/Dehouck et al., 2013
SNAP2Predict functional impacts of mutations using neural network-based modelhttps://rostlab.org/services/snap2web/Hecht et al., 2015
EnvisionSupervised, stochastic gradient boosting algorithm to quantify the effect of mutationhttps://envision.gs.washington.edu/shiny/envision_new/Gray et al., 2018
EVmutationUnsupervised statistical method to predict effect of mutations using residue dependencies between positionshttps://marks.hms.harvard.edu/evmutation/

Some of the commonly used tools for predicting the effect of mutations on protein structure and function.

Our own group have developed over the past decade the mCSM suite of computer programmes that use ML/AI approaches to predict the impacts of amino acid mutations not only on protomer stability (Pires et al., 2014a) but also on protein-protein, protein nucleic acid and protein-ligand interactions (Pires et al., 2016; Pires and Ascher, 2017). Recently, there have been further developments in the field where machine learning (ML)-based methods are gaining popularity. Many more recent ML methods also use features derived from protein structure and/or sequence to predict the effect of mutations (Hopf et al., 2017). A recent review, summarizes the performance of different ML methods and emphasizes the need for selecting reliable training dataset and informative features (Fang, 2020). Deep learning is an advanced training which is composed of multiple layers to learn different features from the input data and is proven to learn from the high-dimensional data. Recently, a method called DeepCLIP (Grønning et al., 2020) has been proposed which can predict protein binding to RNA using only sequence data. Another recently developed deep learning framework-MuPIPR (Zhou et al., 2020) (Mutation Effects in Protein–protein Interaction Prediction Using Contextualized Representations), can predict the effects of mutation on protein-protein interactions in terms of changes in buried surface area and binding affinity.

In-silico Saturation Mutagenesis

Using the tools described above, computational efforts exploiting recent growth in the availability of computing power can be immensely helpful to perform saturation mutagenesis, which can be used as a surveillance tool for drug resistance in leprosy. These mutational scanning exercises can provide crucial insights into the structure-function relationships by exploring all possible substitutions at a given site. This can provide a glimpse into the functional consequences of mutations in antimicrobial-resistance phenotypes. The extensive quantitative data from computational saturation mutagenesis experiments can guide experimental approaches and prove helpful for validation and/or engineering purposes. Recently published HARP (a database of Hansen’s Disease Antimicrobial Resistance Profiles) database (Vedithi et al., 2020) is a comprehensive repository of in-silico mutagenesis experiments for three important drug targets for M. leprae namely dihydropteroate synthase, RNA polymerase and DNA gyrase. A consensus impact for all the possible mutations on protein stability and function of these drug targets is provided in this database.

Druggability

Mycobacterium leprae genome is reduced to 3,268,203 bp preserving only 1,614 ORFs (Cole et al., 2001; Liu et al., 2004) of the Mycobacterial genus. The genome reduction is due to evolutionary adaptation of this intracellular obligate bacillus to Schwann and macrophages cells. Gene essentiality in M. leprae is deciphered based on essentiality of homologous genes, mainly in M. tuberculosis that are determined by experiments (Sassetti et al., 2003; DeJesus et al., 2017). Because of the evolutionary loss of non-essential genes by pseudogenization, only 65% of the existing total of M. leprae genes have been demonstrated to be essentials (Borah et al., 2020). In order to identify potential drug targets, a chokepoint reaction analysis helps to find proteins that are either consumers of unique substrates, or are unique producers of metabolites. It is predicted that the inhibition of chokepoint proteins produces an interruption of essential cell functions (Yeh et al., 2004). Determining the druggability of protein targets is important to avoid intractable targets. A druggable protein has the ability to bind with high affinity to a drug. In leprosy, the dihydropteroate synthase (DHPS), RNA polymerase (RNAP) and DNA gyrase (GYR) are known druggable proteins as they are the targets of Dapsone, Rifampicin and Ofloxacin, respectively. Nevertheless, protein druggability properties can be predicted by different bioinformatics tools based on the 3D structure/model of the protein. For example, the α-1,2-mannosyltransferase and mannosyltransferase proteins related to lipoarabinomannan pathway were identified as a possible drug targets using CASTp (Computer Altas of Surface Topography of proteins) (Gupta et al., 2020). CASTp determines the volume and the area of each cavity and pocket. Furthermore, for each pocket the solvent accessible surface and the molecular surface are calculated. Small-molecule virtual screening is another in-silico strategy used to ascertain druggability of the protein target. This approach provides an understanding of the physicochemical features of the binding sites and potential ligands that bind at these sites. In Mycobacterium tuberculosis, 2,809 proteins are identified as druggable using this in-silico approach (Anand and Chandra, 2014). Mammalian cell entry proteins of the class mce1A-E have been reported in M. leprae to facilitate bacterial entry into human nasal epithelial cells (Fadlitha et al., 2019). Mce1A has a significant role in the cell predilection and a comprehensive understanding of the structure and druggability of this protein can provide insights into host pathogen interactions and transmission in leprosy (Sato et al., 2007). In the case of ML2177c, this gene encodes for uridine phosphorylase and shows significantly high expression during leprosy infection. This is predicted as druggable using fragment-hotspot-map analysis (Malhotra et al., 2017). The fragment hotspots contain a juxtaposition of charge and lipophilicity that are essential for effective ligand binding through both enthalpic and entropic contributions. The hotspot map software uses different molecular probes (toluene, aniline and phenol) to calculate affinity maps that provide a visual guide of the pocket (Radoux et al., 2016). Figure 3 illustrates the recommended pathway to target prioritization in mycobacterial drug discovery.

FIGURE 3

Structure-Guided Virtual Screening

Structure-guided virtual screening is a cost-effective computational tool for preliminary screening of proteins that are potential drug targets with chemical libraries ranging from small core fragments to large macrocyclic compounds in size and scaling from a few hundred molecules to billions (used in ultra-large-scale virtual screening campaigns). Since physical synthesis of drug molecules is not required, millions of virtual chemical entities can be swiftly docked into the active site of the protein structure/model and appropriate chemical scaffolds that fit with high scores and form relevant stabilizing interactions can be shortlisted for experimental validations. Virtual screening can be applied to novel drug discovery and also in drug repositioning experiments (screening with existing approved drugs to identify new target-protein interactions). A repurposing screen of LipU, a lipolytic protein that is conserved across mycobacterial species and noted to be essential for survival of M. leprae, revealed high docking scores for anti-viral drugs and anti-hypertensive (Kaur et al., 2019). Molecular docking software, such as Glide (Friesner et al., 2006), CCDC-GOLD (Jones et al., 1997), Autodock (Goodsell and Olson, 1990), Ledock (Wang et al., 2016), FlexX (Kramer et al., 1999), and SwissDock (Grosdidier et al., 2011) are used in virtual screening campaigns. Each algorithm has a unique scoring function to assess the fitness, number of stable interatomic interactions, and changes in energy landscape.

Discussion and Conclusion

Here, we have reviewed the tools and the advances made in protein structure prediction, modeling of genomes and impacts of amino acid replacements on protein structure and function. We have discussed these areas particularly focusing on the mycobacterial genomes, more specifically M. leprae. Given the paucity of information related to structural proteomic studies in leprosy, we discussed a multi-task protein modeling pipeline that enables proteome-scale template-based modeling of individual proteins encoded by various annotated genes in M. leprae. Homology-based structural and functional annotation of these protein models (Ochoa-Montaño et al., 2015; Skwark et al., 2019) using appropriate computational tools for modeling and druggability assessment can expedite characterization of the structural proteome of M. leprae and accelerate structure-guided novel drug discovery to combat nerve damage associated with leprosy.

Using the latest advancements in the field of protein structure bioinformatics, we describe our attempts to perform proteome scale modeling of mycobacterial genomes using in-house databases and pipelines. The modeled protein monomers or (homo/hetero) oligomers are subjected to multiple state-of-the-art validation scores. These models can be very helpful and provide useful insights to understand protein structure and function, identify drug targets and unravel their functional roles in the pathogen. The structures of selected drug targets can also help in experimental design and prioritizing the protein targets for validation.

The emergence of drug resistance to the multidrug therapy is a major challenge in treating mycobacterial infections especially leprosy where structural features of drug-target interactions are poorly understood. To complement the computational findings, our group has employed cryoEM methods to understand the impact of mutations on the structure of catalase peroxidase in M. tuberculosis (Munir et al., 2019, 2021). Protein sequences and structures can be used to model the impacts of drug resistance mutation on protein structure and function. We have described various software available for predicting the impacts of mutations using protein sequence or structure or both. In-silico saturation mutagenesis experiments can guide the experimental design and help in saving the time and labor required to conduct laboratory experiments on animal models. Structure-based drug design (Blundell et al., 2002; Blundell and Patel, 2004) is a way forward and is a promising approach to design new drugs and treatments.

Statements

Author contributions

SV, SM, and MADGE conducted the review and written the manuscript. MM, PT, and TB reviewed the manuscript and provided necessary additions to the text. All authors contributed to the article and approved the submitted version.

Funding

SV, MADGE, and MM were supported by American Leprosy Missions Grant, United States of America (Grant No: G88726), SM was supported by the MRC DBT Grant, United Kingdom and India (RG78439), and TB was supported by the Wellcome Trust Programme Grant, United Kingdom (200814/Z/16/Z).

Acknowledgments

Authors would like to thank the rest of the computational biology team at the Department of Biochemistry, University of Cambridge, United Kingdom, for their overarching support and guidance in the review process.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    AdzhubeiI. A.SchmidtS.PeshkinL.RamenskyV. E.GerasimovaA.BorkP.et al (2010). A method and server for predicting damaging missense mutations.Nat. Methods7248249. 10.1038/nmeth0410-248

  • 2

    AlQuraishiM. (2019). AlphaFold at CASP13.Bioinformatics3548624865. 10.1093/bioinformatics/btz422

  • 3

    AnandP.ChandraN. (2014). Characterizing the pocketome of Mycobacterium tuberculosis and application in rationalizing polypharmacological target selection.Sci. Rep.46356. 10.1038/srep06356

  • 4

    AndreevaA.KuleshaE.GoughJ.MurzinA. G. (2020). The SCOP database in 2020: expanded classification of representative family and superfamily domains of known protein structures.Nucleic Acids Res.48D376D382. 10.1093/nar/gkz1064

  • 5

    BaughL.PhanI.BegleyD. W.CliftonM. C.ArmourB.DranowD. M.et al (2015). Increasing the structural coverage of tuberculosis drug targets.Tuberculosis (Edinb)95142148. 10.1016/j.tube.2014.12.003

  • 6

    BermanH. M.WestbrookJ.FengZ.GillilandG.BhatT. N.WeissigH.et al (2000). The protein data bank.Nucleic Acids Res.28235242.

  • 7

    BienertS.WaterhouseA.de BeerT. A. P.TaurielloG.StuderG.BordoliL.et al (2017). The SWISS-MODEL repository—new features and functionality.Nucleic Acids Res.45D313D319. 10.1093/nar/gkw1132

  • 8

    BinkowskiT. A.NaghibzadehS.LiangJ. (2003). CASTp: computed Atlas of surface topography of proteins.Nucleic Acids Res.3133523355. 10.1093/nar/gkg512

  • 9

    BlowerT. R.WilliamsonB. H.KernsR. J.BergerJ. M. (2016). Crystal structure and stability of gyrase–fluoroquinolone cleaved complexes from Mycobacterium tuberculosis.Proc. Natl. Acad. Sci. U.S.A.11317061713. 10.1073/pnas.1525047113

  • 10

    BlundellT. L.JhotiH.AbellC. (2002). High-throughput crystallography for lead discovery in drug design.Nat. Rev. Drug Discov.14554. 10.1038/nrd706

  • 11

    BlundellT. L.PatelS. (2004). High-throughput X-ray crystallography for drug discovery.Curr. Opin. Pharmacol.4490496. 10.1016/j.coph.2004.04.007

  • 12

    BorahK.KearneyJ.-L.BanerjeeR.VatsP.WuH.DahaleS.et al (2020). GSMN-ML- a genome scale metabolic network reconstruction of the obligate human pathogen Mycobacterium leprae.PLoS Negl. Trop. Dis.14:e0007871. 10.1371/journal.pntd.0007871

  • 13

    CambauE.CarthagenaL.ChauffourA.JiB.JarlierV. (2006). Dihydropteroate synthase mutations in the Folp1 gene predict dapsone resistance in relapsed cases of leprosy.Clin. Infect. Dis.42238241. 10.1086/498506

  • 14

    CambauE.SaundersonP.MatsuokaM.ColeS.KaiM.SuffysP.et al (2018). Antimicrobial resistance in leprosy: results of the first prospective open survey conducted by a WHO surveillance network for the period 2009–15.Clin. Microbiol. Infect.2413051310.

  • 15

    CapriottiE.FariselliP.RossiI.CasadioR. (2008). A three-state prediction of single point mutations on protein stability changes.BMC Bioinformatics9:S6. 10.1186/1471-2105-9-S2-S6

  • 16

    ChenC.-W.LinJ.ChuY.-W. (2013). iStable: off-the-shelf predictor integration for predicting protein stability changes.BMC Bioinformatics14:S5. 10.1186/1471-2105-14-S2-S5

  • 17

    ChenK.-H.WangT.-F.HuY.-J. (2019). Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme.BMC Bioinformatics20:308. 10.1186/s12859-019-2907-1

  • 18

    ChenV. B.ArendallW. B.HeaddJ. J.KeedyD. A.ImmorminoR. M.KapralG. J.et al (2010). MolProbity: all-atom structure validation for macromolecular crystallography.Acta Crystallogr. D Biol. Crystallogr.661221. 10.1107/S0907444909042073

  • 19

    ChengJ.RandallA.BaldiP. (2006). Prediction of protein stability changes for single-site mutations using support vector machines.Proteins6211251132. 10.1002/prot.20810

  • 20

    ChiarelliL. R.MoriG.OrenaB. S.EspositoM.LaneT.de Jesus Lopes RibeiroA. L.et al (2018). A multitarget approach to drug discovery inhibiting Mycobacterium tuberculosis PyrG and PanK.Sci. Rep.8:3187. 10.1038/s41598-018-21614-4

  • 21

    ChoiY.ChanA. P. (2015). PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels.Bioinformatics3127452747. 10.1093/bioinformatics/btv195

  • 22

    ColeS. T.EiglmeierK.ParkhillJ.JamesK. D.ThomsonN. R.WheelerP. R.et al (2001). Massive gene decay in the leprosy bacillus.Nature40910071011. 10.1038/35059006

  • 23

    DasS.ChakrabartiS. (2021). Classification and prediction of protein–protein interaction interface using machine learning algorithm.Sci. Rep.11:1761. 10.1038/s41598-020-80900-2

  • 24

    DehouckY.GrosfilsA.FolchB.GilisD.BogaertsP.RoomanM. (2009). Fast and accurate predictions of protein stability changes upon mutations using statistical potentials and neural networks: PoPMuSiC-2.0.Bioinformatics2525372543. 10.1093/bioinformatics/btp445

  • 25

    DehouckY.KwasigrochJ. M.RoomanM.GilisD. (2013). BeAtMuSiC: prediction of changes in protein–protein binding affinity on mutations.Nucleic Acids Res.41W333W339. 10.1093/nar/gkt450

  • 26

    DeJesusM. A.GerrickE. R.XuW.ParkS. W.LongJ. E.BoutteC. C.et al (2017). Comprehensive essentiality analysis of the Mycobacterium tuberculosis genome via saturating transposon mutagenesis.mBio8:e0213316. 10.1128/mBio.02133-16

  • 27

    DhopleA. M. (2002). In vivo activity of epiroprim, a dihydrofolate reductase inhibitor, singly and in combination with dapsone, against Mycobacterium leprae.Int. J. Antimicrob. Agents197174. 10.1016/s0924-8579(01)00470-8

  • 28

    DuX.SunS.HuC.YaoY.YanY.ZhangY. (2017). DeepPPI: boosting prediction of protein–protein interactions with deep neural networks.J. Chem. Inf. Model.5714991510. 10.1021/acs.jcim.7b00028

  • 29

    EramianD.ShenM.DevosD.MeloF.SaliA.Marti-RenomM. A. (2006). A composite score for predicting errors in protein structure models.Protein Sci.1516531666. 10.1110/ps.062095806

  • 30

    FadlithaV. B.YamamotoF.IdrisI.DahlanH.SatoN.AftitahV. B.et al (2019). The unique tropism of Mycobacterium leprae to the nasal epithelial cells can be explained by the mammalian cell entry protein 1A.PLoS Negl. Trop. Dis.13:e0006704. 10.1371/journal.pntd.0006704

  • 31

    FangJ. (2020). A critical review of five machine learning-based algorithms for predicting protein stability changes upon mutation.Brief Bioinform2112851292. 10.1093/bib/bbz071

  • 32

    FriesnerR. A.MurphyR. B.RepaskyM. P.FryeL. L.GreenwoodJ. R.HalgrenT. A.et al (2006). Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes.J. Med. Chem.4961776196. 10.1021/jm051256o

  • 33

    GoodsellD. S.OlsonA. J. (1990). Automated docking of substrates to proteins by simulated annealing.Proteins8195202. 10.1002/prot.340080302

  • 34

    GrañaM.HaouzA.BuschiazzoA.MirasI.WehenkelA.BondetV.et al (2007). The crystal structure of M. leprae ML2640c defines a large family of putative S-adenosylmethionine-dependent methyltransferases in mycobacteria.Protein Sci.1618961904. 10.1110/ps.072982707

  • 35

    GrayV. E.HauseR. J.LuebeckJ.ShendureJ.FowlerD. M. (2018). Quantitative missense variant effect prediction using large-scale mutagenesis data.Cell Syst.6116124.e3. 10.1016/j.cels.2017.11.003

  • 36

    GrønningA. G. B.DoktorT. K.LarsenS. J.PetersenU. S. S.HolmL. L.BruunG. H.et al (2020). DeepCLIP: predicting the effect of mutations on protein–RNA binding with deep learning.Nucleic Acids Res.4870997118. 10.1093/nar/gkaa530

  • 37

    GrosdidierA.ZoeteV.MichielinO. (2011). SwissDock, a protein-small molecule docking web service based on EADock DSS.Nucleic Acids Res.39W270W277. 10.1093/nar/gkr366

  • 38

    GuoH.CourbonG. M.BuelerS. A.MaiJ.LiuJ.RubinsteinJ. L. (2021). Structure of mycobacterial ATP synthase bound to the tuberculosis drug bedaquiline.Nature589143147. 10.1038/s41586-020-3004-3

  • 39

    GuptaE.GuptaS. R. R.NirajR. R. K. (2020). Identification of drug and vaccine target in Mycobacterium leprae: a reverse vaccinology approach.Int. J. Pept. Res. Ther.2613131326. 10.1007/s10989-019-09936-x

  • 40

    HatzopoulosG. N.Mueller-DieckmannJ. (2010). Structure of translation initiation factor 1 from Mycobacterium tuberculosis and inferred binding to the 30S ribosomal subunit.FEBS Lett.58410111015. 10.1016/j.febslet.2010.01.051

  • 41

    HechtM.BrombergY.RostB. (2015). Better prediction of functional effects for sequence variants.BMC Genomics16:S1. 10.1186/1471-2164-16-S8-S1

  • 42

    HentschelJ.BurnsideC.MignotI.LeibundgutM.BoehringerD.BanN. (2017). The complete structure of the Mycobacterium smegmatis 70S ribosome.Cell Rep.20149160. 10.1016/j.celrep.2017.06.029

  • 43

    HopfT. A.IngrahamJ. B.PoelwijkF. J.SchärfeC. P. I.SpringerM.SanderC.et al (2017). Mutation effects predicted from sequence co-variation. Nat. Biotechnol.35128135. 10.1038/nbt.3769

  • 44

    HuL.ChanK. C. C. (2017). Extracting coevolutionary features from protein sequences for predicting protein-protein interactions.IEEE ACM Trans. Comput. Biol. Bioinform.14155166. 10.1109/TCBB.2016.2520923

  • 45

    HuangB.FuJ.GuoC.WuX.LinD.LiaoX. (2016). (1)H, (15)N, (13)C resonance assignments for pyrazinoic acid binding domain of ribosomal protein S1 from Mycobacterium tuberculosis.Biomol. NMR Assign.10321324. 10.1007/s12104-016-9692-9

  • 46

    HuangY.-A.YouZ.-H.ChenX.ChanK.LuoX. (2016). Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding.BMC Bioinformatics17:184. 10.1186/s12859-016-1035-4

  • 47

    JiB.JametP.PeraniE. G.SowS.LienhardtC.PetinonC.et al (1996). Bactericidal activity of single dose of clarithromycin plus minocycline, with or without ofloxacin, against Mycobacterium leprae in patients.Antimicrob. Agents Chemother.4021372141. 10.1128/aac.40.9.2137

  • 48

    JonesG.WillettP.GlenR. C.LeachA. R.TaylorR. (1997). Development and validation of a genetic algorithm for flexible docking11Edited by F. E. Cohen.J. Mol. Biol.267727748. 10.1006/jmbi.1996.0897

  • 49

    KaurG.PandeyB.KumarA.GarewalN.GroverA.KaurJ. (2019). Drug targeted virtual screening and molecular dynamics of LipU protein of Mycobacterium tuberculosis and Mycobacterium leprae.J. Biomol. Struct. Dyn.3712541269. 10.1080/07391102.2018.1454852

  • 50

    KaushalP. S.SinghP.SharmaA.MuniyappaK.VijayanM. (2010). X-ray and molecular-dynamics studies on Mycobacterium leprae single-stranded DNA-binding protein and comparison with other eubacterial SSB structures.Acta Crystallogr. D Biol. Crystallogr.6610481058. 10.1107/S0907444910032208

  • 51

    KramerB.RareyM.LengauerT. (1999). Evaluation of the FLEXX incremental construction algorithm for protein–ligand docking.Proteins37228241. 10.1002/(sici)1097-0134(19991101)37:2<228::aid-prot8>3.0.co;2-8

  • 52

    LaimerJ.HoferH.FritzM.WegenkittlS.LacknerP. (2015). MAESTRO – multi agent stability prediction upon point mutations.BMC Bioinformatics16:116. 10.1186/s12859-015-0548-6

  • 53

    LechartierB.ColeS. T. (2015). Mode of Action of Clofazimine and Combination Therapy with Benzothiazinones against Mycobacterium tuberculosis.Antimicrob. Agents Chemother.5944574463. 10.1128/AAC.00395-15

  • 54

    LiM.SimonettiF. L.GoncearencoA.PanchenkoA. R. (2016). MutaBind estimates and interprets the effects of sequence variants on protein–protein interactions.Nucleic Acids Res.44W494W501. 10.1093/nar/gkw374

  • 55

    LiY.GoldingG. B.IlieL. (2020). DELPHI: accurate deep ensemble model for protein interaction sites prediction. Bioinformatics btaa750. 10.1093/bioinformatics/btaa750[Epub ahead of print].

  • 56

    LinW.MandalS.DegenD.LiuY.EbrightY. W.LiS.et al (2017). Structural basis of Mycobacterium tuberculosis transcription and transcription inhibition.Mol. Cell66169179.e8. 10.1016/j.molcel.2017.03.001

  • 57

    LiuY.HarrisonP. M.KuninV.GersteinM. (2004). Comprehensive analysis of pseudogenes in prokaryotes: widespread gene decay and failure of putative horizontally transferred genes.Genome Biol.5:R64. 10.1186/gb-2004-5-9-r64

  • 58

    LockwoodD. N.SaundersonP. R. (2012). Nerve damage in leprosy: a continuing challenge to scientists, clinicians and service providers.Int. Health47785. 10.1016/j.inhe.2011.09.006

  • 59

    MalhotraS.VedithiS. C.BlundellT. L. (2017). Decoding the similarities and differences among mycobacterial species.PLoS Negl. Trop. Dis.11:e0005883. 10.1371/journal.pntd.0005883

  • 60

    MandeS. C.MehraV.BloomB. R.HolW. G. (1996). Structure of the heat shock protein chaperonin-10 of Mycobacterium leprae.Science271203207. 10.1126/science.271.5246.203

  • 61

    ManglaniP. R.ArifM. A. (2006). Multidrug therapy in leprosy.J. Indian Med. Assoc.104686688.

  • 62

    MukherjeeS.ZhangY. (2011). Protein-protein complex structure predictions by multimeric threading and template recombination.Structure19955966. 10.1016/j.str.2011.04.006

  • 63

    MunirA.KumarN.RamalingamS. B.TamilzhalaganS.ShanmugamS. K.PalaniappanA. N.et al (2019). Identification and characterization of genetic determinants of isoniazid and rifampicin resistance in Mycobacterium tuberculosis in Southern India.Sci. Rep.9:10283. 10.1038/s41598-019-46756-x

  • 64

    MunirA.WilsonM. T.HardwickS. W.ChirgadzeD. Y.WorrallJ. A. R.BlundellT. L.et al (2021). Using cryo-EM to understand antimycobacterial resistance in the catalase-peroxidase (KatG) from Mycobacterium tuberculosis. Structure10.1016/j.str.2020.12.008[Epub ahead of print].

  • 65

    NgP. C.HenikoffS. (2003). SIFT: predicting amino acid changes that affect protein function.Nucleic Acids Res.3138123814. 10.1093/nar/gkg509

  • 66

    Ochoa-MontañoB.MohanN.BlundellT. L. (2015). CHOPIN: a web resource for the structural and functional proteome of Mycobacterium tuberculosis.Database (Oxford)2015:bav026. 10.1093/database/bav026

  • 67

    OgmenU.KeskinO.AytunaA. S.NussinovR.GursoyA. (2005). PRISM: protein interactions by structural matching.Nucleic Acids Res.33W331W336. 10.1093/nar/gki585

  • 68

    PanduranganA. P.Ochoa-MontañoB.AscherD. B.BlundellT. L. (2017). SDM: a server for predicting effects of mutations on protein stability.Nucleic Acids Res.45W229W235. 10.1093/nar/gkx439

  • 69

    PejaverV.UrrestiJ.Lugo-MartinezJ.PagelK. A.LinG. N.NamH.-J.et al (2020). Inferring the molecular and phenotypic impact of amino acid variants with MutPred2.Nat. Commun.11113. 10.1038/s41467-020-19669-x

  • 70

    PieperU.WebbB. M.BarkanD. T.Schneidman-DuhovnyD.SchlessingerA.BrabergH.et al (2011). ModBase, a database of annotated comparative protein structure models, and associated resources.Nucleic Acids Res.39D465D474. 10.1093/nar/gkq1091

  • 71

    PiresD. E. V.AscherD. B. (2017). mCSM–NA: predicting the effects of mutations on protein–nucleic acids interactions.Nucleic Acids Res.45W241W246. 10.1093/nar/gkx236

  • 72

    PiresD. E. V.AscherD. B.BlundellT. L. (2014a). DUET: a server for predicting effects of mutations on protein stability using an integrated computational approach.Nucleic Acids Res.42W314W319. 10.1093/nar/gku411

  • 73

    PiresD. E. V.AscherD. B.BlundellT. L. (2014b). mCSM: predicting the effects of mutations in proteins using graph-based signatures.Bioinformatics30335342. 10.1093/bioinformatics/btt691

  • 74

    PiresD. E. V.BlundellT. L.AscherD. B. (2016). mCSM-lig: quantifying the effects of mutations on protein-small molecule affinity in genetic disease and emergence of drug resistance.Sci. Rep.6:29575. 10.1038/srep29575

  • 75

    PotapovV.CohenM.InbarY.SchreiberG. (2010). Protein structure modelling and evaluation based on a 4-distance description of side-chain interactions.BMC Bioinformatics11:374. 10.1186/1471-2105-11-374

  • 76

    RadouxC. J.OlssonT. S. G.PittW. R.GroomC. R.BlundellT. L. (2016). Identifying interactions that determine fragment binding at protein hotspots.J. Med. Chem.5943144325. 10.1021/acs.jmedchem.5b01980

  • 77

    RaoP. N.JainS. (2013). Newer Management Options in Leprosy.Indian J. Dermatol.58611. 10.4103/0019-5154.105274

  • 78

    RoeS. M.BarlowT.BrownT.OramM.KeeleyA.TsanevaI. R.et al (1998). Crystal structure of an octameric RuvA-holliday junction complex.Mol. Cell2361372. 10.1016/s1097-2765(00)80280-4

  • 79

    SassettiC. M.BoydD. H.RubinE. J. (2003). Genes required for mycobacterial growth defined by high density mutagenesis.Mol. Microbiol.487784. 10.1046/j.1365-2958.2003.03425.x

  • 80

    SatoN.FujimuraT.MasuzawaM.YogiY.MatsuokaM.KanohM.et al (2007). Recombinant Mycobacterium leprae protein associated with entry into mammalian cells of respiratory and skin components.J. Dermatol. Sci.46101110. 10.1016/j.jdermsci.2007.01.006

  • 81

    SchymkowitzJ.BorgJ.StricherF.NysR.RousseauF.SerranoL. (2005). The FoldX web server: an online force field.Nucleic Acids Res.33W382W388. 10.1093/nar/gki387

  • 82

    ShenM.SaliA. (2006). Statistical potential for assessment and prediction of protein structures.Protein Sci.1525072524. 10.1110/ps.062416606

  • 83

    ShiJ.BlundellT. L.MizuguchiK. (2001). FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties.J. Mol. Biol.310243257. 10.1006/jmbi.2001.4762

  • 84

    SillitoeI.DawsonN.LewisT. E.DasS.LeesJ. G.AshfordP.et al (2019). CATH: expanding the horizons of structure-based functional annotations for genome sequences.Nucleic Acids Res.47D280D284. 10.1093/nar/gky1097

  • 85

    SkwarkM. J.TorresP. H. M.CopoiuL.BannermanB.FlotoR. A.BlundellT. L. (2019). Mabellini: a genome-wide database for understanding the structural proteome and evaluating prospective antimicrobial targets of the emerging pathogen Mycobacterium abscessus.Database2019baz113. 10.1093/database/baz113

  • 86

    SoniN.MadhusudhanM. S. (2017). Computational modeling of protein assemblies.Curr. Opin. Struct. Biol.44179189. 10.1016/j.sbi.2017.04.006

  • 87

    SunT.ZhouB.LaiL.PeiJ. (2017). Sequence-based prediction of protein protein interaction using a deep-learning algorithm.BMC Bioinformatics18:277. 10.1186/s12859-017-1700-2

  • 88

    The UniProt Consortium (2021). UniProt: the universal protein knowledgebase in 2021.Nucleic Acids Res.49D480D489. 10.1093/nar/gkaa1100

  • 89

    TorrisiM.PollastriG.LeQ. (2020). Deep learning methods in protein structure prediction.Comput. Struct. Biotechnol. J.1813011310. 10.1016/j.csbj.2019.12.011

  • 90

    VedithiS. C.MalhotraS.DasM.DanielS.KishoreN.GeorgeA.et al (2018). Structural implications of mutations conferring rifampin resistance in Mycobacterium leprae.Sci. Rep.8:5016. 10.1038/s41598-018-23423-1

  • 91

    VedithiS. C.MalhotraS.SkwarkM. J.MunirA.Acebrón-García-De-EulateM.WamanV. P.et al (2020). HARP: a database of structural impacts of systematic missense mutations in drug targets of Mycobacterium leprae.Comput. Struct. Biotechnol. J.1836923704. 10.1016/j.csbj.2020.11.013

  • 92

    WangF.LangleyR.GultenG.DoverL. G.BesraG. S.JacobsW. R.et al (2007). Mechanism of thioamide drug action against tuberculosis and leprosy.J. Exp. Med.2047378. 10.1084/jem.20062100

  • 93

    WangL.WangH.-F.LiuS.-R.YanX.SongK.-J. (2019). Predicting protein-protein interactions from matrix-based protein sequence using convolution neural network and feature-selective rotation forest.Sci. Rep.9112. 10.1038/s41598-019-46369-4

  • 94

    WangZ.SunH.YaoX.LiD.XuL.LiY.et al (2016). Comprehensive evaluation of ten docking programs on a diverse set of protein–ligand complexes: the prediction accuracy of sampling power and scoring power.Phys. Chem. Chem. Phys.181296412975. 10.1039/C6CP01555G

  • 95

    WebbB.SaliA. (2016). Comparative Protein structure modeling using MODELLER.Curr. Protoc. Bioinformatics545.6.15.6.37. 10.1002/cpbi.3

  • 96

    WilliamsD. L.SpringL.HarrisE.RocheP.GillisT. P. (2000). Dihydropteroate synthase of Mycobacterium leprae and dapsone resistance.Antimicrob. Agents Chemother.4415301537. 10.1128/aac.44.6.1530-1537.2000

  • 97

    YamaguchiT.YokoyamaK.NakajimaC.SuzukiY. (2016). DC-159a shows inhibitory activity against DNA gyrases of Mycobacterium leprae.PLoS Negl. Trop. Dis.10:e0005013. 10.1371/journal.pntd.0005013

  • 98

    YehI.HanekampT.TsokaS.KarpP. D.AltmanR. B. (2004). Computational analysis of Plasmodium falciparum metabolism: organizing genomic information to facilitate drug discovery.Genome Res.14917924. 10.1101/gr.2050304

  • 99

    YuJ.VavrusaM.AndreaniJ.ReyJ.TufféryP.GueroisR. (2016). InterEvDock: a docking server to predict the structure of protein–protein interactions using evolutionary information.Nucleic Acids Res.44W542W549. 10.1093/nar/gkw340

  • 100

    YueP.MelamudE.MoultJ. (2006). SNPs3D: candidate gene and SNP selection for association studies.BMC Bioinformatics7:166. 10.1186/1471-2105-7-166

  • 101

    ZhangL.YuG.XiaD.WangJ. (2019). Protein–protein interactions prediction based on ensemble deep neural networks.Neurocomputing3241019. 10.1016/j.neucom.2018.02.097

  • 102

    ZhouG.ChenM.JuC. J. T.WangZ.JiangJ.-Y.WangW. (2020). Mutation effect estimation on protein–protein interactions using deep contextualized representation learning.NAR Genom Bioinform.2. 10.1093/nargab/lqaa015

  • 103

    ZhouH.SkolnickJ. (2011). GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction.Biophys. J.10120432052. 10.1016/j.bpj.2011.09.012

Summary

Keywords

Mycobacterium leprae, amino acid substitution, chokepoint reactions, drug binging sites, homology (comparative) modeling, protein interface

Citation

Vedithi SC, Malhotra S, Acebrón-García-de-Eulate M, Matusevicius M, Torres PHM and Blundell TL (2021) Structure-Guided Computational Approaches to Unravel Druggable Proteomic Landscape of Mycobacterium leprae. Front. Mol. Biosci. 8:663301. doi: 10.3389/fmolb.2021.663301

Received

02 February 2021

Accepted

12 April 2021

Published

07 May 2021

Volume

8 - 2021

Edited by

Alexandre G. De Brevern, INSERM U1134 Biologie Intégrée du Globule Rouge, France

Reviewed by

Stéphane Téletchéa, Université de Nantes, France; Jeremy Esque, Institut Biotechnologique de Toulouse (INSA), France

Updates

Copyright

*Correspondence: Sundeep Chaitanya Vedithi, Tom L. Blundell,

These authors have contributed equally to this work

This article was submitted to Biological Modeling and Simulation, a section of the journal Frontiers in Molecular Biosciences

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics