VTR: A Web Tool for Identifying Analogous Contacts on Protein Structures and Their Complexes

Pimentel, Vitor; Mariano, Diego; Cantão, Letícia Xavier Silva; Bastos, Luana Luiza; Fischer, Pedro; de Lima, Leonardo Henrique Franca; Fassio, Alexandre Victor; Melo-Minardi, Raquel Cardoso de

doi:10.3389/fbinf.2021.730350

ORIGINAL RESEARCH article

Front. Bioinform., 08 November 2021
Sec. Data Visualization
Volume 1 - 2021 | https://doi.org/10.3389/fbinf.2021.730350

VTR: A Web Tool for Identifying Analogous Contacts on Protein Structures and Their Complexes

Vitor Pimentel¹^†

Diego Mariano¹^†

Letícia Xavier Silva Cantão¹

Luana Luiza Bastos¹ www.frontiersin.org

Pedro Fischer²

Leonardo Henrique Franca de Lima² www.frontiersin.org

Alexandre Victor Fassio¹

Raquel Cardoso de Melo-Minardi¹*

¹Laboratory of Bioinformatics and Systems, Department of Computer Science, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
²Laboratory of Molecular Modelling and Bioinformatics (LAMMB), Department of Physical and Biological Sciences, Universidade Federal de São João Del-Rei, Sete Lagoas, Brazil

Evolutionarily related proteins can present similar structures but very dissimilar sequences. Hence, understanding the role of the inter-residues contacts for the protein structure has been the target of many studies. Contacts comprise non-covalent interactions, which are essential to stabilize macromolecular structures such as proteins. Here we show VTR, a new method for the detection of analogous contacts in protein pairs. The VTR web tool performs structural alignment between proteins and detects interactions that occur in similar regions. To evaluate our tool, we proposed three case studies: we 1) compared vertebrate myoglobin and truncated invertebrate hemoglobin; 2) analyzed interactions between the spike protein RBD of SARS-CoV-2 and the cell receptor ACE2; and 3) compared a glucose-tolerant and a non-tolerant β-glucosidase enzyme used for biofuel production. The case studies demonstrate the potential of VTR for the understanding of functional similarities between distantly sequence-related proteins, as well as the exploration of important drug targets and rational design of enzymes for industrial applications. We envision VTR as a promising tool for understanding differences and similarities between homologous proteins with similar 3D structures but different sequences. VTR is available at http://bioinfo.dcc.ufmg.br/vtr.

Introduction

Proteins are macromolecules responsible for most functions in living beings, such as transport, immune protection, control growth, and so on (de Melo et al., 2006). The function of a protein is directly related to its three-dimensional structure. Previous studies have demonstrated that structural changes are related to sequence changes (Chothia and Lesk, 1986). Even small modifications, such as mutations, insertions, or deletions, can change the structure (Almassy and Dickerson, 1978; Lesk and Chothia, 1980; Lesk and Chothia, 1982). However, evolutionarily related proteins may present similar structures but different sequences (Gan et al., 2002). Sequence alignments can unambiguously distinguish similar and non-similar structures when the identity is over 40%. Even sequences with identities of 20–35% may generate false negatives for homology identification (Rost, 1999). Also, studies have reported similarities in structures with sequence identities lower than 20% (Chothia and Lesk, 1986). Until now, the understanding of how the polypeptide sequences fold into a particular three-dimensional shape after synthesis remains a mystery (Science, 2005; Upadhyay, 2019). It has motivated the search for computational algorithms to predict protein structures from their sequences (Dill et al., 2008; Dill and MacCallum, 2012) or even detect and annotate protein functions correctly (Veloso et al., 2007; Franciscani et al., 2014; Silveira et al., 2014). Besides, understanding protein structures and their interactions accurately is crucial to molecule’s rational design for several applications, including discovering novel drugs and improving enzymes for the biotechnological industry (Kuntz, 1992; Barroso et al., 2020).

Thus, understanding the role of inter-residues contacts in protein folding, stabilization, and function has been the goal of several studies (Melo et al., 2007; Silva et al., 2019). Contacts are weak and potentially stabilizing interactions in the structure of macromolecules, such as proteins (Martins et al., 2018; Silva et al., 2019). They can be hydrophobic interactions, electrostatic attractive or repulsive interactions, disulfide bonds, aromatic stacking, hydrogen bonds, and so on Sobolev et al. (1999), Neshich et al. (2003), Mancini et al. (2004), Melo et al. (2007). Contacts also have been used to compare protein structures, for instance, from contact maps (de Melo et al., 2006) or graph-based structural signatures (Pires et al., 2013). Recent computational approaches have suggested that substituting non-interacting residues for interacting ones can improve protein stability, highlighting the importance of computation of contacts (Barroso et al., 2020). In addition, pairwise comparisons between proteins can be performed using visual and structural alignment tools, such as PyMOL (Schrödinger, 2015). However, comparisons between contacts are usually performed individually, which makes comparisons between a considerable number of contacts toilsome. To the best of our knowledge, there is no tool for systematic structural comparisons between contacts in a protein pair.

Therefore, herein, we propose a novel approach to detect and align contacts for protein structure analysis and pairwise comparison. Our algorithm aims to detect differences and similarities in amino acid residues pairs in contact compared to analogous positions. For this purpose, we first perform a structural superposition between two proteins, detect contacts through a cutoff-based approach, and compare contacts in analogous positions using a score. We also developed a user-friendly web tool called VTR to facilitate the use of our method. Finally, as a proof of concept, we propose three case studies: 1) a comparison between a myoglobin (PDB ID: 1a6m) and a hemoglobin (PDB ID: 1dlw), proteins with similar structure but low sequence identity (18%); 2) a comparison of interactions among the spike protein RBD of SARS-CoV-2, SARS-CoV, and the cell receptor ACE2; and 3) a comparison between a glucose-tolerant β-glucosidase enzyme efficient for biofuel production (Bgl1A) and a less efficient non-tolerant β-glucosidase (Bgl1B).

Materials and Methods

The VTR algorithm receives as input two PDB (Protein Data Bank) (Berman et al., 2000; wwPDB consortium, 2019) files and processes them in the back-end using in-house scripts. The files are analyzed in three steps: 1) structural superposition; 2) contacts computation; and 3) search for contact matches.

Structural Superposition

VTR performs structural alignments between protein pairs using the default parameters of the TM-align algorithm (Zhang and Skolnick, 2005). TM-align receives as input two PDB files and returns the coordinates of a superimposed structure. TM-align will return the best alignment possible, and VTR will return a warning informing if contact matches could not be found.

Contact Computation

We compute the Euclidean distance among all-atom coordinates using in-house Python scripts. The tool identifies five types of possible interactions: hydrophobic, attractive/repulsive, hydrogen bonds, aromatic stacking, and disulfide bonds. A pair of residues are in contact if any atom pair meets the cutoff ranges presented in Table 1.

TABLE 1

TABLE 1. Cutoff distances for each contact type with cutoff values obtained from Sobolev et al. (1999), Neshich et al. (2003), de Melo et al. (2006), Bickerton et al. (2011), Fassio et al. (2019).

Contact must occur between atoms of two different residues. The atoms involved are described in Supplementary Table S1. For the detection of aromatic stacking, we determined the centroids of the rings (cutoff 2–6 Å). For phenylalanine and tyrosine, we calculate the median coordinate between the atoms CG and CZ for determining the ring centroid. For tryptophan, we used the median coordinate between the atoms CD2 and CE2. For histidine, we determined the median coordinate of the atoms CG, ND1, CE1, NE2, and CD2.

By default, VTR ignores contacts between atoms of the main chain of neighborhood residues (until reaching four positions) to remove contacts that compose the structures of α-helices. However, through the web tool, users can enable the detection of these contacts (increase processing time).

Search for Contact Matches

We used in-house scripts to compare the contacts in analogous positions. We proposed the AVD (average vector distance) metric to measure the distance between contacts and detect contact matches. AVD is calculated by Eq. 1:

AVD (P, Q) = \min (\frac{D (p 1, q 1) + D (p 2, q 2)}{2}, \frac{D (p 1, q 2) + D (p 2, q 1)}{2}) (1)

where P represents the contact between atoms p1 and p2 of protein A; Q represents the contact between the atoms q1 and q2 of protein B; and D is a function that returns the Euclidean distance between atomic coordinates. To detect a contact match between P and Q, the AVD(P, Q) should be the lowest value. We determine the VTR score based on Eq. 2:

VTR (A, B) = \frac{\sum_{i = 1}^{n} AV D_{i}}{Cn} \frac{(d_{A} + d_{B})}{(c_{A} + c_{B})} (2)

where A and B are the proteins analyzed; n is the number of matches found; C is the cutoff determined for the AVD; $d_{A}$ and $d_{B}$ are the number of contacts without match in both proteins; $c_{A}$ and $c_{B}$ are the number of contacts found in each protein. The VTR score is a value between 0 and 1, where the lower the value, the more similar are two proteins in terms of inter-residue contacts.

Web-Based Tool

The VTR web tool was developed using CodeIgniter (https://codeigniter.com/), D3 (https://d3js.org), jQuery (https://jquery.com), DataTables (https://datatables.net), and Bootstrap CSS and JavaScript library (https://getbootstrap.com). 3D structure visualizations are presented using 3Dmol.js (Rego and Koes, 2015).

Case Studies Details

For case study 1, we collected the PDBs entries of sperm whale myoglobin (PDB ID: 1a6m) (Vojtechovský et al., 1999) and the Paramecium caudatum single-chain and truncated hemoglobin (PDB ID: 1dlw) (Pesce et al., 2000). For the second case study, we selected SARS-CoV-2 (PDB ID: 6m0j) (Lan et al., 2020) and SARS-CoV (PDB ID: 2ajf) (Li et al., 2005) structures in the RCSB PDB. Each PDB file contains two chains A and E, where A represents the cell receptor ACE2 and E the receptor-binding domain (RBD) portion of the virus.

For the third case study, we selected the sequences of the glucose-tolerant GH1 β-glucosidase of a South China Sea metagenome (Bgl1A; UniProt ID: D5KX75) (Fang et al., 2010) and the non-tolerant GH1 β-glucosidase of a South China Sea metagenome (Bgl1B; UniProt ID: D0VEC8) (Fang et al., 2009) from Glutantbase (Mariano et al., 2020). We also constructed two mutants, H57D (Bgl1A) and D57H (Bgl1B), to evaluate VTR’s ability to propose mutations for enzymes based on differences of contacts (Supplementary Tables S5–S6). The 3D structures of wild (Bgl1A and Bgl1B) and mutant (Bgl1A: H57D and Bgl1B: D57H) structures were constructed using SWISS-MODEL (Arnold et al., 2006; Biasini et al., 2014) (Supplementary Tables S7–S8). To evaluate the impact of mutations, we estimated the Poisson-Boltzmann surface area (PBSA) with the integrated use of the online versions of the H++, PDB2PQR, and the Adaptive Poisson-Boltzmann Solver (APBS) tools (Baker et al., 2001; Dolinsky et al., 2004; Unni et al., 2011; Anandakrishnan et al., 2012; Jurrus et al., 2018). Details were included in the supplementary material (Supplementary Text S1).

Results and Discussion

VTR Web Tool Workflow

VTR receives as input two PDB files (hereafter called A and B). We suggest that PDBs present similar folding, but if structures with different folds were used, VTR would try to perform the best structural alignment using the TM-align tool. It rotates and translates the coordinates of the protein A considering the alignment with B. VTR allows three search methods: 1) ALL, which calculates all interactions for both whole complexes; 2) SINGLE, which calculates contacts in a single chain for each protein and compares them (users must inform a target chain for each protein); and 3) PPI, which calculates protein–protein interactions in both complexes and then compares them (users must inform a chain pair interacting for each protein).

Then, VTR uses in-house Python scripts to calculate contacts: 1) hydrogen bonds; 2) salt bridge (hydrogen bonds and attractive); 3) ionic interaction (attractive); 4) repulsive; 5) hydrophobic interactions; 6) aromatic stacking; and 7) disulfide bonds. Then, VTR determines contacts in analogous positions using the AVD score (average distance between the coordinates of atoms in contact), performs comparative statistical, and returns the results for the VTR interface (Figures 1A–C).

FIGURE 1

FIGURE 1. VTR workflow. (A) VTR web tool receives as input two PDB files. (B) VTR pipeline analyzes the PDB files in three steps: 1) structural superposition between the PDB files using TM-align; 2) contacts calculation using cutoff definitions obtained from the literature; and 3) search for analogous contacts using AVD strategy. (C) VTR returns the contact matches, and users can use them in the web interface. Also, VTR determines the statistics of differences between amino acids in contact.

Case Studies

To evaluate the VTR tool, we proposed three pairwise comparative case studies: 1) eukaryotic myoglobin and a truncated non-vertebrate hemoglobin chain; 2) interactions among the cell receptor ACE2 and both SARS-CoV-2 and SARS-CoV; and 3) glucose-tolerant and non-tolerant β-glucosidases. Contact maps generated by VTR demonstrate similarities in the contact pattern between protein pairs in each case study. For instance, Figure 2A is more similar to Figures 2B–F. We aim with these case studies to demonstrate a simple use for VTR (case study 1) and show the tool’s effectiveness in finding shared and unshared contacts in two systems already described in the literature (case study 2). In the third case study, we propose a real use of VTR in detecting possible mutation sites for enhancing enzymes with biotechnological applications.

FIGURE 2

FIGURE 2. Contact maps for case studies 1 (A–B), 2 (C–D), and 3 (E–F). Each point represents a contact. Contacts of similar types are shown in the same colors. In the x-axes and y-axes are shown the residue numbers. (A) PDB ID: 1DLW; (B) PDB ID: 1A6M; (C) PDB ID: 2AJF; (D) PDB ID: 6M0J; (E) Bgl1A; (F) Bgl1B. For (C) and (D), we show only contact maps for chain (A). We consider all contacts for the determination of the contact maps, including contacts between atoms of the main chain of closer residues (such as those present in alpha-helices).

Case Study 1: Myoglobin and Hemoglobin

In the first case study, we performed the contacts alignment between sperm whale myoglobin (PDB ID: 1a6m) and truncated single-chain hemoglobin from Paramecium caudatum ciliated protozoan (PDB ID: 1dlw). Myoglobin and hemoglobin are both oxygen-binding proteins belonging to the widespread and distantly related globin family (Hardison, 2012). For this case study, the evaluated myoglobin structure (1a6m) was at the oxy state (i.e., oxygen bound), while the hemoglobin structure (1dlw) was at the de-oxy one (i.e., oxygen unbound). The 1a6m presents a sequence length of 151 amino acids, while 1dlw, as a typical non-vertebrate truncated hemoglobin chain, has a minor amino acid content, with just 116 residues. Both proteins present a similar folding but a sequence identity of only 18%. The literature has described those structures of homologous sequences with an identity lower than 20% may present large structural differences (Chothia and Lesk, 1986). However, the discrepancy about this typical strong relationship between sequence and folding similarity for the globin family has long been known. In fact, globins form a family substantially conserved in structural topology, despite the distant sequence relationship, being this one of the higher conundrums in biochemistry (Lesk and Chothia, 1980; Hardison, 2012). Hence, we believe these proteins to be potential targets for the comparison of analogous contacts using VTR.

After processing both structures with default parameters, VTR detected the following contacts for 1a6m: 85 hydrogen bonds, 81 attractive interactions, 32 repulsive interactions, nine aromatic stacking, and 364 hydrophobic interactions. We found the following contacts in 1dlw: 65 hydrogen bonds, 20 attractive interactions, one repulsive interaction, two aromatic stacking, and 221 hydrophobic interactions. However, we obtained only 13 main contact matches in analogous positions using a 2 Å AVD cutoff (Supplementary Table S2). The contact matches increase to 60 when considering conserved contact matches of hydrophobic type.

From the contacts predicted in analogous positions, we found 12 different types (Figure 3; Supplementary Tables S2–S4), such as V21-V26 and Q13-N43 (Figure 3A). For 1a6m, VTR detected an attractive contact between H36 and E38. However, it detected a hydrogen bond between D26 and T28 in analogous positions of 1dlw (Figure 3B). On the other hand, in 1a6m, I75 performs hydrophobic interactions with L86, while F48 performs the same type of interaction with V109 (Figure 3C). Interestingly, I75 and L86 are located at a distance of 11 amino acids, while F48 and V109 are at a distance of 61 positions. This highlights the VTR’s ability to detect contacts even in different sequence positions. Most of the contact matches are located at compatible distances of 2–4 amino acids, such as K102-F106 and A76-T80 (Figure 3D), L104-S108 and F78-I82 (Figure 3E), and L89-H93 and L64-H68 (Figure 3F). Only the hydrogen bond detected in the contact L89-H93 of 1a6m was considered a match with Q13-N43. This may suggest that both contacts present similar importance for the stability of both proteins.

FIGURE 3

FIGURE 3. Six analogous contacts between 1a6m (cyan sticks) and 1dlw (green sticks). (Hy) hydrophobic; (Hb) Hydrogen bond; (At) Attractive ionic interaction. (A) V21-V66 (1a6m) and Q13-N43 (1dlw); (B) H36-E38 (1a6m) and D26-T28 (1dlw); (C) I75-L89 (1dlw) and F48-L64 (1dlw); (D) K102-F106 (1a1m) and A76-T80 (1dlw); (E) L104-S108 (1a6m) and F78-I82 (1dlw); and (F) L89-H93 (1a6m) and L64-H68 (1dlw). Sticks were colored using the cyan/green color scheme.

It is essential to highlight that, for this analysis, we did not consider the interactions between atoms of the main chain of closer residues. We did it to reduce the number of contacts detected in alpha-helices, which can explain the low number of hydrogen bonds conserved. Enabling the advanced option “detection of structural contacts in α-helices,” the number of hydrogen bond contact matches increases to 129. This option is disabled by default because VTR desires to highlight conserved interactions with the side-chain atoms. Also, enabling this option increases the computational cost once that all contacts of similar secondary structures in close regions will be computed.

Case Study 2: Analyzing Interactions Between the Spike Protein RBD of SARS-CoV-2 and the Cell Receptor ACE2

Recently, a new coronavirus (SARS-CoV-2) was related to severe acute respiratory syndrome (COVID-19), spreading rapidly worldwide, and causing a pandemic situation (Zhou et al., 2020). A sequence comparison demonstrates that SARS-CoV-2 structures share approximately 80% of sequence identity with the SARS-CoV (Lan et al., 2020; Zhou et al., 2020). Like SARS-CoV, SARS-CoV2 recognizes the ACE2 (Angiotensin-Converting Enzyme 2) receptor in humans. Hence, understanding the binding mechanism of the spike RBD (Receptor-Binding Domain) of SARS-CoV-2 with ACE2 may help to shed some light on the mechanism of recognition of virus receptors and the initial infection process. In a recent study, Lan et al. (2020) identified the relevant residues to the interaction of SARS-CoV-2 with the receptor. It was noted that most of these residues are highly conserved and shared with SARS-CoV. Here, we verified the ability of VTR to find the known contacts between different chains of both structures. We compared the structures of SARS-CoV-2 spike RBD (PDB ID 6m0j) and SARS-CoV spike RBD (PDB ID 2ajf), both in complex with the ACE2 receptor. We maintain the standard 2 Å AVD cutoff, and we use chain A (ACE2) and chain E (RBD portion of the virus) for both structures.

Compared to a study by Lan et al. (2020), the VTR could find all 16 atomic contacts between SARS-CoV and ACE2 (3 salt bridges and 13 hydrogen bonds). VTR could also find 15 contacts between SARS-CoV-2 and ACE2 (2 salt bridges and 13 hydrogen bonds). Of these, VTR calculated and presented which contacts were shared between receptor and SARS-CoV or SARS-CoV-2. We detected all contacts described in Lan et al. (2020) (Table 2).

TABLE 2

TABLE 2. Analogous contacts between proteins with PDB entries SARS-CoV (6m0j) and 2ajf. R1: residue 1; R2: residue 2; HB: hydrogen bonds, HY: hydrophobic. In the last column, we highlighted lines where the matches were reported in Lan et al. (2020). Residues in contact, but atom interactions were not described in Lan et al. (2020), are presented as “−.”

VTR calculated six shared hydrogen bonds and one hydrophobic interaction between SARS-CoV and a receptor that are not present in Lan et al. (2020) (Table 2). Although the hydrophobic interaction between Y41 (CE2) and T487 (CG2) was not described in Lan et al. (2020), the hydrogen bond between Y41 and T487 was presented (Table 2; lines 1 and 2). VTR also detected a permuted interaction between atoms of D38 and Y449 (6m0j) and D38 and Y436 (2ajf) (Table 2; lines 3 and 7). Moreover, the interactions between G354 (O) and G502 (N) of 6m0j and their equivalents in 2ajf appear to be probably detected due to the cutoff-based strategy (Table 2; line 8). Also, VTR detected that G502 interacted with K353: interaction described in Lan et al. (2020). Among the conserved hydrogen bond contacts, four called our attention as they were described in the literature (Figures 4A–D).

FIGURE 4

FIGURE 4. Four analogous contacts between SARS-CoV and ACE2 (purple sticks), and SARS-CoV-2 and ACE2 (yellow sticks). VTR suggests that (A) H34-Y453 (6M0J) is equivalent to H34-Y440 (2AJF); (B) E37-Y505 (6M0J) is equivalent to E37-Y491 (2AJF); (C) F28-Y489 (6M0J) is equivalent to F28-Y475 (2AJF); and (D) Q42-Y449 (6M0J) is equivalent to Q42-Y436 (2AJF). Sticks were colored using the yellow/purple color scheme.

For SARS-CoV-2, VTR detected a hydrogen bond between H34 and Y453 that seems to be equivalent to H34 and Y440 in SARS-CoV (Figure 4A). The same applies to the following contact pairs: E37-Y505 (6M0J) and E37-Y491 (2AJF) (Figure 4B); F28-Y489 (6M0J) and F28-436 (2AJF) (Figure 4C); and Q42-Y449 (6M0J) and Q42-Y436 (2AJF) (Figure 4D). It is important to highlight that besides the interaction between E37 and Y505 (Figure 4B), the residue E37 possibly interacts with R403 in 6M0J, but there is no equivalent contact in 2AJF. Also, Lan et al. (2020) described an interaction between Y83 (OH) and Y489 (OH) of 6M0J. Besides this interaction, VTR also pointed out that Y489 (OH) interacts with the main-chain atoms of F28. It demonstrates that the cutoff-based strategy used by VTR also has advantages when compared to more restrictive methods. This possible interaction could, in future studies, be better comprehended through molecular dynamics experiments.

Case Study 3 - Glucose-Tolerant β-glucosidases

β-glucosidases (E.C. 3.2.1.21) are enzymes that act in the last step of the saccharification process, cleaving cellobiose into two molecules of glucose (Ketudat Cairns and Esen, 2010; Mariano et al., 2017). Hence, they are considered very important for the second-generation biofuel industrial applications (Bergmann et al., 2014; Costa et al., 2019; Limade et al., 2020). Besides, the literature has reported that most β-glucosidases are inhibited in high glucose concentrations (Teugjas and Väljamäe, 2013; de Giuseppe et al., 2014). Therefore, the design of enzymes more resistant to glucose inhibition has motivated research around the world (Salgado et al., 2018). Recently, two β-glucosidases extracted from the marine metagenome of the South China Sea were characterized and evaluated (Yang et al., 2015). The first one (Bgl1A) was able to keep its activity even in glucose concentrations up to 1,000 mM (Fang et al., 2010), being described as glucose-tolerant (a class of β-glucosidase enzymes with high potential for industrial use). On the other hand, the second one (Bgl1B) was inhibited in concentrations up to 50 mM (Fang et al., 2009). Both enzymes have a similar TIM-barrel folding, belonging to the first family of glycoside hydrolases (GH1) (Cantarel et al., 2009). Moreover, they present a sequence similarity higher than 50% (Mariano et al., 2019). Thus, we decided to submit to VTR the structures of Bgl1A and Bgl1B to evaluate similar contacts and identify possible differences.

VTR found 375 main contact matches (984 considering hydrophobics), an average RMSD (for contact matches) of 0.96, and a VTR score of 0.19. From 375 matches, 327 maintain the contact type, and 48 change the contact type. The matches that change the contact type are interesting targets for the detection of differences in structures and potentially can be used to suggest mutations. Thus, such information may guide a biotechnological industry at providing glucose tolerance characteristics to Bgl1B through the rational design of enzymes using site-directed mutagenesis.

After analyzing the results (available in the Supplementary material), one contact match caught our attention: D41-H57 of Bgl1A (Figure 5A) that corresponds to D41-D57 of Bgl1B (Figure 5B). While Bgl1A presents an attractive contact, Bgl1b presents a repulsive interaction. These contacts are located in the extremities of loop A. Furthermore, this loop has been reported in the literature as necessary for restricting the entrance and exit of the active site in glucose-tolerant β-glucosidases (Limade et al., 2020). Hence, changes in this region could modify the mobility of the protein and could explain the differences in the glucose tolerance of both enzymes.

FIGURE 5

FIGURE 5. Important contacts in analogous positions between Bgl1A and Bgl1B. (A) Attractive interaction D41-H57 in loop A of Bgl1A. (B) Repulsive interaction D41-D57 in loop A of Bgl1B. (A–B) Protein backbones are shown as grey cartoons. Amino acid residues are shown as cyan (Bgl1A) and green (Bgl1B) cartoons. In both proteins, D41 also interacts with R35. (C–D) Illustrative scheme of the importance of these contacts. (C) Bgl1A: H57 performs attractive interactions with D41. (D) Bgl1B: D41 performs repulsive interactions with D57.

The literature has reported implications of the topology and dynamics of loop A for the differences in glucose tolerance and inhibition between Bgl1A and Bgl1B (Yang et al., 2015; Limade et al., 2020). To probe the potential of our method for protein engineering, we have checked how considerable were the physical-chemical differences attributed by this point modification at loop A as a whole.

Firstly, we modeled in silico a mutant of Bgl1A (D57H) and a mutant of Bgl1B (H57D). We expected that Bgl1B’s mutant presented characteristics similar to Bgl1A (i.e., characteristics that could lead to glucose tolerance). As a control, we modeled a Bgl1A mutant that we expected to present features like Bgl1B (i.e., negative characteristics for biofuel production). Then, we inspected the mesoscopic influence of such single amino acid modification at the regional electrostatic surface by Poisson-Boltzmann analysis.

PBSA Points Substantial Electrostatic Differences

The estimation of the protonation states at pH 7.0 has recovered the usual HSE protonation for the H57 in Bgl1A (i.e., a neutral histidine protonated just at the ε nitrogen atom). This is consonant with the position of the side chain of this histidine in loop A relatively turned to the solvent, with just a marginal contact with the D41 residue at the opposite side. Also, the presence of the positively charged R35 prevents that the H57 local pKa be changed enough by the next D41 to induce a bi-protonation at this histidine (Figure 5). Hence, while the D41-D57 in Bgl1B can be classified as an electrically repulsive contact, the D41-H57 interaction in Bgl1A is not a charge to charge but a charge-dipole attractive interaction. Even so, the Poisson-Boltzmann surface analysis (PBSA) indicates a substantial difference between Bgl1A and Bgl1B in the electrostatic surface potential of loop A (Figures 6A,B).

FIGURE 6

FIGURE 6. Poisson-Boltzmann surface potential at the vicinity between positions 41 and 57 in Bgl1A, Bgl1B, and mutants. (A) Bg1lA; (B) Bgl1B; (C) Bgl1A’s H57D mutant; (D) Bgl1B’s D57H mutant. The PBSA colors are on a red: white: blue scale for electrostatic potentials around −10.00: −5.00: 0.00 k_bT/β unities (being k_b the Boltzmann constant, T the 300 K temperature, and β the electron coulomb charge). Residues 41 and 57 are depicted as spheres. Regions whose electrostatic potential is affected by the H57D or D57D permutations are depicted as “x” for Bgl1A or “*” for Bgl1B.

Indeed, all the topological context around loop A in Bgl1B accumulates a substantial amount of negative electrical potential (red region in Figure 6B). On the other hand, Bgl1A presents a surface with a more neutral electrical potential (blue and white regions in Figure 6A). While the D41-D57 contact appears to be a hot spot of negative charge accumulation in Bgl1B, the correspondent D41-H57 in Bgl1A shares similar neutrality of the surrounding region. The substitutions H57D (Bgl1A) and D57H (Bgl1B) introduce a point inversion of this behavior in both proteins (Figures 6C,D). Also, these permutations cause electrostatic modifications at specific points of this region. Comparing Figures 6A,C (Bgl1A and its mutant), we can observe an increase of red regions. Contrarily, comparing Figures 6B,D (Bgl1B and its mutant), we can observe a reduction of the red regions.

Furthermore, the sites with local potential affected by the permutations appear to be the same in Bgl1A and Bgl1B, spread around loop A. The dynamics and topology of this region are strongly correlated to the functional differences between both proteins (Yang et al., 2015; Limade et al., 2020). In addition, the charge inversions in this region have been correlated to changes in activity and stability (González-Blasco et al., 2000; Tamaki et al., 2014). All these factors led us to look for a glimpse of the influence of the D41-H57D permutation in the vibrational dynamics of loop A.

Vibrational Modulation of Loop A

The representativity of the eigenvectors (Supplementary Figure S1) demonstrates the differences between the PCA recovered fluctuation between loops A of Bgl1A and Bgl1B. Bgl1A fluctuation is more homogeneously represented by the two first eigenvectors (PCs) of loop A, with a fractional eigenvalue of 63% for PC1 and 29% for PC2. On the other hand, the fluctuations at the Bgl1B’s loop A are strongly located at PC1, that alone accounts for 82% of the sum of all eigenvalues. This can indicate that a single kind of movement, with a substantial fugacity from the middle structure at the minima, accounts for most of the vibratory mobility of Bgl1B’s loop A. This is consonant with a local instability caused by the highly repulsive environment at this loop (Figure 6B).

In Supplementary Figure S1, we can see higher proximity among the lines that represent wild Bgl1B, Bgl1B’s mutant, and Bgl1A’s mutant. The loop A of the wild Bgl1A presents a more equilibrated and distributed fluctuation between different modes (in concurrence with the more neutral profile in Figure 6A). In addition, while the H57D exchange is enough to invert the vibrational eigenvalue distribution of the Bgl1A’s loop A to the Bgl1B pattern, the same does not occur for Bgl1B with the D57H permutation (Supplementary Figure S1A). Hence, the more neutralized electrostatic surface for Bgl1A (Supplementary Figure S1A) is sensitive to the point insertion of a repulsive contact at the loop A basis. On the other hand, the strongly negatively charged Bgl1B’s loop A is less responsive to neutralization at this single point.

Furthermore, the analysis of the distribution of the fluctuations allocated at the two first eigenvectors (PC1+2) of loop A also shows considerable differences (Supplementary Figures S1B–F). Bgl1A presents higher apparent vibrational mobility concentrated at the C-terminal portion of the N-terminal helix of this loop (residues 45–50). In contrast, Bgl1B has its high mobility equally spread along the entire medial portion of loop A (residues 45–55). The different electrostatic contexts (Figures 6A,B) may explain the vibrational distinctions (Supplementary Figures S1A–B). Despite this, the impact in contact patterns, caused by changes in position 57, promotes at least qualitatively standardized modifications around half of loop A (region indicated by lines in Supplementary Figures S1C–F). The presence of a negatively charged amino acid in position 57 promotes, both in Bgl1A and Bgl1B’s environments, an increase in mobility of positions 44, 50, 51, and 57 itself. It also promotes a reduction in the mobility of the region between residues 45 to 49.

For the D41-D57 repulsive contact, the vibrational movement of both acids is unbalanced, with one moving more or faster than the other (“*” in Supplementary Figures S1D–E). On the other hand, the movement involving the D41-H57 contact is more balanced (mainly for Bgl1A’s context), indicating a more equilibrated vibration again.

Implications for Protein Engineering

The intensity of some of these modifications or changes at the rest of loop A seems to be context-dependent. This agrees with the fact that the permutation of the entire loop A between Bgl1A and Bgl1B was able to revert the glucose tolerant/inhibited behavior, but with poor results for local single amino acid substitutions (Yang et al., 2015). Despite this, a single substitution detected by VTR was able to promote vibrational changes with the same pattern at almost half of the extension of this functionally crucial loop.

All these mentioned facts illustrate a potential use for this tool. If properly combined with electrostatic and mobility patterns detection techniques, such as the APBS and molecular mechanics/PCA here employed, VTR can be useful for rational protein engineering. The integrated use of VTR with computational or experimental tools can be promising to find the topologically minimum modifications that need to be carried at proteins to introduce some desirable characteristics found in other homologous.

For GH1 β-glucosidases, a protein class extensively used in industry and with a strong interest in the second-generation bioethanol production (Mariano et al., 2017), this approach shows itself to be encouraging. Overall, minimal topological modifications represent a promising strategy for suggesting mutations, especially in the beta-glucosidase loop regions that surround the active site. The topology and dynamics of these loops can allow or restrict movements involved in glucose entrance and exit (i.e., glucose tolerance) (Yang et al., 2015; Costa et al., 2019; Konar et al., 2019; Limade et al., 2020) or also can affect the thermostability (González-Blasco et al., 2000; Tamaki et al., 2014; Konar et al., 2019). These are both examples of industrially desirable characteristics for these and other proteins.

Conclusion

Herein, we presented VTR, a novel approach with a visual web interface that can be used to analyze, compare, and scrutinize analogous contacts in protein pairs. We explored the tool features using three case studies, where we demonstrated the potential of VTR to shed some light on the mechanisms of topology conservation on phylogenetically related but sequentially distant proteins. We also evaluated contact similarities and dissimilarities on pharmacologically relevant targets. We suggested the use of our tool to the rational design of proteins with biotechnological applications. Concerning the second case study, both the confirmation of contacts already reported in the literature and the finding of four hydrogen bond matches still not described are promising finds for the future rational design of anti-Sars-Cov-2 drugs. In the last case study, we compared a glucose-tolerant beta-glucosidase enzyme with a non-tolerant one. VTR detected several changes in contact types of analogous positions. Called our attention a change in an attractive contact by a repulsive: D41-H57 of Bgl1A that corresponds to D41-D57 of Bgl1B. We explored the importance of this contact using molecular mechanics minimization and vibrational inspection by principal component analysis. Our results demonstrate that the presence of this contact is vital for the stability of Bgl1A. However, Bgl1B’s mutant with a similar interaction was not enough to present similar characteristics of the glucose-tolerant one on a substantial extension. Nevertheless, this case study illustrates the potential of the VTR tool for the rational design of industrial enzymes and gives some glimpses about electrostatic and vibrational aspects of β-glucosidase enzymes. We hope VTR can be used for understanding differences and similarities between homologous proteins with similar 3D structures but differences in sequences. VTR is available at http://bioinfo.dcc.ufmg.br/vtr.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Author Contributions

VP and DM developed the web tool and performed the first case study. LC and LB performed the second case study. LL and PF performed APBS and molecular dynamics analysis for the third case study. DM wrote the manuscript. VP, PF, LL, AF, and RM revised the manuscript. Project design and funding acquisition: RM. All authors read and approved the manuscript.

Funding

This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001 (project grant number 51/2013 - 23038.004007/2014-82), and by FAPEMIG - Fundação de Amparo à Pesquisa do Estado de Minas Gerais.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

The authors thank the funding agencies: Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG), and Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq).

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbinf.2021.730350/full#supplementary-material

References

Almassy, R. J., and Dickerson, R. E. (1978). Pseudomonas Cytochrome C551 at 2.0 A Resolution: Enlargement of the Cytochrome C Family. Proc. Natl. Acad. Sci. U S A. 75, 2674–2678. doi:10.1073/pnas.75.6.2674

PubMed Abstract | CrossRef Full Text | Google Scholar

Anandakrishnan, R., Aguilar, B., and Onufriev, A. V. (2012). H++ 3.0: Automating pK Prediction and the Preparation of Biomolecular Structures for Atomistic Molecular Modeling and Simulations. Nucleic Acids Res. 40, W537–W541. doi:10.1093/nar/gks375

PubMed Abstract | CrossRef Full Text | Google Scholar

Arnold, K., Bordoli, L., Kopp, J., and Schwede, T. (2006). The SWISS-MODEL Workspace: A Web-Based Environment for Protein Structure Homology Modelling. Bioinformatics 22, 195–201. doi:10.1093/bioinformatics/bti770

PubMed Abstract | CrossRef Full Text | Google Scholar

Baker, N. A., Sept, D., Joseph, S., Holst, M. J., and McCammon, J. A. (2001). Electrostatics of Nanosystems: Application to Microtubules and the Ribosome. Proc. Natl. Acad. Sci. U S A. 98, 10037–10041. doi:10.1073/pnas.181342398

PubMed Abstract | CrossRef Full Text | Google Scholar

Barroso, J. R. M. S., Mariano, D., Dias, S. R., Rocha, R. E. O., Santos, L. H., Nagem, R. A. P., et al. (2020). Proteus: An Algorithm for Proposing Stabilizing Mutation Pairs Based on Interactions Observed in Known Protein 3D Structures. BMC Bioinformatics 21, 275. doi:10.1186/s12859-020-03575-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Bergmann, J. C., Costa, O. Y., Gladden, J. M., Singer, S., Heins, R., D'haeseleer, P., et al. (2014). Discovery of Two Novel β-glucosidases from an Amazon Soil Metagenomic Library. FEMS Microbiol. Lett. 351, 147–155. doi:10.1111/1574-6968.12332

PubMed Abstract | CrossRef Full Text | Google Scholar

Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., et al. (2000). The Protein Data Bank. Nucleic Acids Res. 28, 235–242. doi:10.1093/nar/28.1.235

PubMed Abstract | CrossRef Full Text | Google Scholar

Biasini, M., Bienert, S., Waterhouse, A., Arnold, K., Studer, G., Schmidt, T., et al. (2014). SWISS-MODEL: Modelling Protein Tertiary and Quaternary Structure Using Evolutionary Information. Nucleic Acids Res. 42, W252–W258. doi:10.1093/nar/gku340

PubMed Abstract | CrossRef Full Text | Google Scholar

Bickerton, G. R., Higueruelo, A. P., and Blundell, T. L. (2011). Comprehensive, Atomic-Level Characterization of Structurally Characterized Protein-Protein Interactions: The PICCOLO Database. BMC Bioinformatics 12, 313. doi:10.1186/1471-2105-12-313

PubMed Abstract | CrossRef Full Text | Google Scholar

Cantarel, B. L., Coutinho, P. M., Rancurel, C., Bernard, T., Lombard, V., and Henrissat, B. (2009). The Carbohydrate-Active EnZymes Database (CAZy): An Expert Resource for Glycogenomics. Nucleic Acids Res. 37, D233–D238. doi:10.1093/nar/gkn663

PubMed Abstract | CrossRef Full Text | Google Scholar

Chothia, C., and Lesk, A. M. (1986). The Relation between the Divergence of Sequence and Structure in Proteins. EMBO J. 5, 823–826. doi:10.1002/j.1460-2075.1986.tb04288.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Costa, L. S. C., Mariano, D. C. B., Rocha, R. E. O., Kraml, J., Silveira, C. H. D., Liedl, K. R., et al. (2019). Molecular Dynamics Gives New Insights into the Glucose Tolerance and Inhibition Mechanisms on β-Glucosidases. Molecules 24, 3215. doi:10.3390/molecules24183215

PubMed Abstract | CrossRef Full Text | Google Scholar

de Giuseppe, P. O., Souza, Tde. A., Souza, F. H., Zanphorlin, L. M., Machado, C. B., Ward, R. J., et al. (2014). Structural Basis for Glucose Tolerance in GH1 β-Glucosidases. Acta Crystallogr. D Biol. Crystallogr. 70, 1631–1639. doi:10.1107/S1399004714006920

PubMed Abstract | CrossRef Full Text | Google Scholar

de Melo, R. C., Lopes, C. E., Fernandes, F. A., da Silveira, C. H., Santoro, M. M., Carceroni, R. L., et al. (2006). A Contact Map Matching Approach to Protein Structure Similarity Analysis. Genet. Mol. Res. 5, 284–308.

PubMed Abstract | Google Scholar

Dill, K. A., and MacCallum, J. L. (2012). The Protein-Folding Problem, 50 Years on. Science 338, 1042–1046. doi:10.1126/science.1219021

PubMed Abstract | CrossRef Full Text | Google Scholar

Dill, K. A., Ozkan, S. B., Shell, M. S., and Weikl, T. R. (2008). The Protein Folding Problem. Annu. Rev. Biophys. 37, 289–316. doi:10.1146/annurev.biophys.37.092707.153558

PubMed Abstract | CrossRef Full Text | Google Scholar

Dolinsky, T. J., Nielsen, J. E., McCammon, J. A., and Baker, N. A. (2004). PDB2PQR: An Automated Pipeline for the Setup of Poisson-Boltzmann Electrostatics Calculations. Nucleic Acids Res. 32, W665–W667. doi:10.1093/nar/gkh381

PubMed Abstract | CrossRef Full Text | Google Scholar

Fang, W., Fang, Z., Liu, J., Hong, Y., Peng, H., Zhang, X., et al. (2009). Cloning and Characterization of a Beta-Glucosidase from marine Metagenome. Sheng Wu Gong Cheng Xue Bao 25, 1914–1920.

PubMed Abstract | Google Scholar

Fang, Z., Fang, W., Liu, J., Hong, Y., Peng, H., Zhang, X., et al. (2010). Cloning and Characterization of a Beta-Glucosidase from Marine Microbial Metagenome with Excellent Glucose Tolerance. J. Microbiol. Biotechnol. 20, 1351–1358. doi:10.4014/jmb.1003.03011

CrossRef Full Text | Google Scholar

Fassio, A. V., Santos, L. H., Silveira, S. A., Ferreira, R. S., and de Melo-Minardi, R. C. (2019). nAPOLI: A Graph-Based Strategy to Detect and Visualize Conserved Protein-Ligand Interactions in Large-Scale. Ieee/acm Trans. Comput. Biol. Bioinf. 17 (4), 1317–1328. doi:10.1109/TCBB.2019.2892099

PubMed Abstract | CrossRef Full Text | Google Scholar

Franciscani, G., Rodrygo, L. T. S., Raphael, O., João, P. P., Meira, W., and Melo-Minardi, R. (2014). An Annotation Process for Data Visualization Techniques, in Proceedings of International Conference on Data Analytics. Available at https://homepages.dcc.ufmg.br/∼rapha/papers/anotationProcessIARIA14.pdf

Google Scholar

Gan, H. H., Perlow, R. A., Roy, S., Ko, J., Wu, M., Huang, J., et al. (2002). Analysis of Protein Sequence/Structure Similarity Relationships. Biophys. J. 83, 2781–2791. doi:10.1016/s0006-3495(02)75287-9

PubMed Abstract | CrossRef Full Text | Google Scholar

González-Blasco, G., Sanz-Aparicio, J., González, B., Hermoso, J. A., and Polaina, J. (2000). Directed Evolution of Beta -glucosidase A from Paenibacillus Polymyxa to thermal Resistance. J. Biol. Chem. 275, 13708–13712. doi:10.1074/jbc.275.18.13708

CrossRef Full Text | Google Scholar

Hardison, R. C. (2012). Evolution of Hemoglobin and its Genes. Cold Spring Harb. Perspect. Med. 2, a011627. doi:10.1101/cshperspect.a011627

PubMed Abstract | CrossRef Full Text | Google Scholar

Jurrus, E., Engel, D., Star, K., Monson, K., Brandi, J., Felberg, L. E., et al. (2018). Improvements to the APBS Biomolecular Solvation Software Suite. Protein Sci. 27, 112–128. doi:10.1002/pro.3280

PubMed Abstract | CrossRef Full Text | Google Scholar

Ketudat Cairns, J. R., and Esen, A. (2010). β-Glucosidases. Cell. Mol. Life Sci. 67, 3389–3405. doi:10.1007/s00018-010-0399-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Konar, S., Sinha, S. K., Datta, S., and Ghorai, P. K. (2019). Probing the Effect of Glucose on the Activity and Stability of β-Glucosidase: An All-Atom Molecular Dynamics Simulation Investigation. ACS Omega 4, 11189–11196. doi:10.1021/acsomega.9b00509

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuntz, I. D. (1992). Structure-based Strategies for Drug Design and Discovery. Science 257, 1078–1082. doi:10.1126/science.257.5073.1078

PubMed Abstract | CrossRef Full Text | Google Scholar

Lan, J., Ge, J., Yu, J., Shan, S., Zhou, H., Fan, S., et al. (2020). Structure of the SARS-CoV-2 Spike Receptor-Binding Domain Bound to the ACE2 Receptor. Nature 581, 215–220. doi:10.1038/s41586-020-2180-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Lesk, A. M., and Chothia, C. (1982). Evolution of Proteins Formed by Beta-Sheets. II. The Core of the Immunoglobulin Domains. J. Mol. Biol. 160, 325–342. doi:10.1016/0022-2836(82)90179-6

CrossRef Full Text | Google Scholar

Lesk, A. M., and Chothia, C. (1980). How Different Amino Acid Sequences Determine Similar Protein Structures: the Structure and Evolutionary Dynamics of the Globins. J. Mol. Biol. 136, 225–270. doi:10.1016/0022-2836(80)90373-3

CrossRef Full Text | Google Scholar

Li, F., Li, W., Farzan, M., and Harrison, S. C. (2005). Structure of SARS Coronavirus Spike Receptor-Binding Domain Complexed with Receptor. Science 309, 1864–1868. doi:10.1126/science.1116480

PubMed Abstract | CrossRef Full Text | Google Scholar

Limade, L. H. F., Fernandez-Quintéro, M. L., Rocha, R. E. O., Mariano, D. C. B., de Melo-Minardi, R. C., and Liedl, K. R. (2020). Conformational Flexibility Correlates with Glucose Tolerance for point Mutations in β-glucosidases - a Computational Study. J. Biomol. Struct. Dyn. 0, 1–14. doi:10.1080/07391102.2020.1734484

CrossRef Full Text | Google Scholar

Mancini, A. L., Higa, R. H., Oliveira, A., Dominiquini, F., Kuser, P. R., Yamagishi, M. E., et al. (2004). STING Contacts: a Web-Based Application for Identification and Analysis of Amino Acid Contacts within Protein Structure and across Protein Interfaces. Bioinformatics 20, 2145–2147. doi:10.1093/bioinformatics/bth203

PubMed Abstract | CrossRef Full Text | Google Scholar

Mariano, D., Pantuza, N., Santos, L. H., Rocha, R. E. O., de Lima, L. H. F., Bleicher, L., et al. (2020). Glutantβase: A Database for Improving the Rational Design of Glucose-Tolerant β-Glucosidases. BMC Mol. Cel Biol. 21, 50. doi:10.1186/s12860-020-00293-y

CrossRef Full Text | Google Scholar

Mariano, D. C. B., Leite, C., Santos, L. H. S., Marins, L. F., Machado, K. S., Werhli, A. V., et al. (2017). Characterization of Glucose-Tolerant β-glucosidases Used in Biofuel Production under the Bioinformatics Perspective: A Systematic Review. Genet. Mol. Res. 16, 1–19. doi:10.4238/gmr16039740

CrossRef Full Text | Google Scholar

Mariano, D. C. B., Santos, L. H., Machado, K. D. S., Werhli, A. V., de Lima, L. H. F., and de Melo-Minardi, R. C. (2019). A Computational Method to Propose Mutations in Enzymes Based on Structural Signature Variation (SSV). Int. J. Mol. Sci. 20, 333. doi:10.3390/ijms20020333

PubMed Abstract | CrossRef Full Text | Google Scholar

Martins, P. M., Mayrink, V. D., de A. Silveira, S., da Silveira, C. H., de Lima, L. H. F., and de Melo- Minardi, R. C. (2018). “How to Compute Protein Residue Contacts More Accurately,” in Proceedings of the 33rd Annual ACM Symposium on Applied Computing, Pau, France, April 9, 2018 (ACM), 60–67. doi:10.1145/3167132.3167136

CrossRef Full Text | Google Scholar

Melo, R., Ribeiro, C., Murray, C. S., Veloso, C. J. M., Silveira, C. H. D., Neshich, G., et al. (2007). Finding Protein-Protein Interaction Patterns by Contact Map Matching. Genet. Mol. Res. 6 (4), 946–963.

PubMed Abstract | Google Scholar

Neshich, G., Togawa, R. C., Mancini, A. L., Kuser, P. R., Yamagishi, M. E., Pappas, G., et al. (2003). STING Millennium: A Web-Based Suite of Programs for Comprehensive and Simultaneous Analysis of Protein Structure and Sequence. Nucleic Acids Res. 31, 3386–3392. doi:10.1093/nar/gkg578

PubMed Abstract | CrossRef Full Text | Google Scholar

Pesce, A., Couture, M., Dewilde, S., Guertin, M., Yamauchi, K., Ascenzi, P., et al. (2000). A Novel Two-Over-Two Alpha-Helical Sandwich Fold Is Characteristic of the Truncated Hemoglobin Family. EMBO J. 19, 2424–2434. doi:10.1093/emboj/19.11.2424

PubMed Abstract | CrossRef Full Text | Google Scholar

Pires, D. E. V., de Melo-Minardi, R. C., Campos, F. F., Silveira, C. H., Meira, W., et al. (2013). aCSM: Noise-Free Graph-Based Signatures to Large-Scale Receptor-Based Ligand Prediction. Bioinformatics 29, 855–861. doi:10.1093/bioinformatics/btt058

PubMed Abstract | CrossRef Full Text | Google Scholar

Rego, N., and Koes, D. (2015). 3Dmol.js: Molecular Visualization with WebGL. Bioinformatics 31, 1322–1324. doi:10.1093/bioinformatics/btu829

PubMed Abstract | CrossRef Full Text | Google Scholar

Rost, B. (1999). Twilight Zone of Protein Sequence Alignments. Protein Eng. 12, 85–94. doi:10.1093/protein/12.2.85

PubMed Abstract | CrossRef Full Text | Google Scholar

Salgado, J. C. S., Meleiro, L. P., Carli, S., and Ward, R. J. (2018). Glucose Tolerant and Glucose Stimulated β-glucosidases - A Review. Bioresour. Technol. 267, 704–713. doi:10.1016/j.biortech.2018.07.137

PubMed Abstract | CrossRef Full Text | Google Scholar

Schrödinger, L. L. C. (2015). The PyMOL Molecular Graphics System. Version 1.8. Available at https://pymol.org/2/support.html.

Google Scholar

Science, A. A. (2005). So Much More to Know. Science 309, 78–102. doi:10.1126/science.309.5731.78b

PubMed Abstract | CrossRef Full Text | Google Scholar

Silva, M. F. M., Martins, P. M., Mariano, D. C. B., Santos, L. H., Pastorini, I., Pantuza, N., et al. (2019). Proteingo: Motivation, User Experience, and Learning of Molecular Interactions in Biological Complexes. Entertainment Comput. 29, 31–42. doi:10.1016/j.entcom.2018.11.001

CrossRef Full Text | Google Scholar

Silveira, Sde. A., de Melo-Minardi, R. C., da Silveira, C. H., Santoro, M. M., and Meira, W. (2014). ENZYMAP: Exploiting Protein Annotation for Modeling and Predicting EC Number Changes in UniProt/Swiss-Prot. PLoS ONE 9, e89162. doi:10.1371/journal.pone.0089162

PubMed Abstract | CrossRef Full Text | Google Scholar

Sobolev, V., Sorokine, A., Prilusky, J., Abola, E. E., and Edelman, M. (1999). Automated Analysis of Interatomic Contacts in Proteins. Bioinformatics 15, 327–332. doi:10.1093/bioinformatics/15.4.327

PubMed Abstract | CrossRef Full Text | Google Scholar

Tamaki, F. K., Textor, L. C., Polikarpov, I., and Marana, S. R. (2014). Sets of Covariant Residues Modulate the Activity and thermal Stability of GH1 β-glucosidases. PLoS One 9, e96627. doi:10.1371/journal.pone.0096627

PubMed Abstract | CrossRef Full Text | Google Scholar

Teugjas, H., and Väljamäe, P. (2013). Selecting β-glucosidases to Support Cellulases in Cellulose Saccharification. Biotechnol. Biofuels 6, 105. doi:10.1186/1754-6834-6-105

PubMed Abstract | CrossRef Full Text | Google Scholar

Unni, S., Huang, Y., Hanson, R. M., Tobias, M., Krishnan, S., Li, W. W., et al. (2011). Web Servers and Services for Electrostatics Calculations with APBS and PDB2PQR. J. Comput. Chem. 32, 1488–1491. doi:10.1002/jcc.21720

PubMed Abstract | CrossRef Full Text | Google Scholar

Upadhyay, A. (2019). Structure of Proteins: Evolution with Unsolved Mysteries. Prog. Biophys. Mol. Biol. 149, 160–172. doi:10.1016/j.pbiomolbio.2019.04.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Veloso, C. J., Silveira, C. H., Melo, R. C., Ribeiro, C., Lopes, J. C., Santoro, M. M., et al. (2007). On the Characterization of Energy Networks of Proteins. Genet. Mol. Res. 6, 799–820.

PubMed Abstract | Google Scholar

Vojtechovský, J., Chu, K., Berendzen, J., Sweet, R. M., and Schlichting, I. (1999). Crystal Structures of Myoglobin-Ligand Complexes at Near-Atomic Resolution. Biophys. J. 77, 2153–2174. doi:10.1016/S0006-3495(99)77056-6

PubMed Abstract | CrossRef Full Text | Google Scholar

wwPDB consortium (2019). Protein Data Bank: the Single Global Archive for 3D Macromolecular Structure Data. Nucleic Acids Res. 47 (D1), D520–D528. doi:10.1093/nar/gky949

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, Y., Zhang, X., Yin, Q., Fang, W., Fang, Z., Wang, X., et al. (2015). A Mechanism of Glucose Tolerance and Stimulation of GH1 β-glucosidases. Sci. Rep. 5, 17296. doi:10.1038/srep17296

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y., and Skolnick, J. (2005). TM-Align: A Protein Structure Alignment Algorithm Based on the TM-Score. Nucleic Acids Res. 33, 2302–2309. doi:10.1093/nar/gki524

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, P., Yang, X. L., Wang, X. G., Hu, B., Zhang, L., Zhang, W., et al. (2020). A Pneumonia Outbreak Associated with a New Coronavirus of Probable Bat Origin. Nature 579, 270–273. doi:10.1038/s41586-020-2012-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: protein interactions, structural bioinformatics, structural alignment, contacts, rational design of enzymes

Citation: Pimentel V, Mariano D, Cantão LXS, Bastos LL, Fischer P, de Lima LHF, Fassio AV and Melo-Minardi RCd (2021) VTR: A Web Tool for Identifying Analogous Contacts on Protein Structures and Their Complexes. Front. Bioinform. 1:730350. doi: 10.3389/fbinf.2021.730350

Received: 24 June 2021; Accepted: 27 July 2021;
Published: 08 November 2021.

Edited by:

Masahito Ohue, Tokyo Institute of Technology, Japan

Reviewed by:

Sebastian Kmiecik, University of Warsaw, Poland
Joan Segura Mora, University of California, San Diego, United States

Copyright © 2021 Pimentel, Mariano, Cantão, Bastos, Fischer, de Lima, Fassio and Melo-Minardi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Raquel Cardoso de Melo-Minardi, raquelcm@dcc.ufmg.br

^†These authors have contributed equally to this work

ORIGINAL RESEARCH article

VTR: A Web Tool for Identifying Analogous Contacts on Protein Structures and Their Complexes

Introduction

Materials and Methods

Structural Superposition

Contact Computation

Search for Contact Matches

Web-Based Tool

Case Studies Details

Results and Discussion

VTR Web Tool Workflow

Case Studies

Case Study 1: Myoglobin and Hemoglobin

Case Study 2: Analyzing Interactions Between the Spike Protein RBD of SARS-CoV-2 and the Cell Receptor ACE2

Case Study 3 - Glucose-Tolerant β-glucosidases

PBSA Points Substantial Electrostatic Differences

Vibrational Modulation of Loop A

Implications for Protein Engineering

Conclusion

Data Availability Statement

Author Contributions

Funding

Conflict of Interest

Publisher’s Note

Acknowledgments

Supplementary Material

References

This article is part of the Research Topic

People also looked at