Abstract
Disulfide bridges establish a fundamental element in the molecular architecture of proteins and peptides which are involved e.g., in basic biological processes or acting as toxins. NMR spectroscopy is one method to characterize the structure of bioactive compounds including cystine-containing molecules. Although the disulfide bridge itself is invisible in NMR, constraints obtained via the neighboring NMR-active nuclei allow to define the underlying conformation and thereby to resolve their functional background. In this mini-review we present shortly the impact of cysteine and disulfide bonds in the proteasome from different domains of life and give a condensed overview of recent NMR applications for the characterization of disulfide-bond containing biomolecules including advantages and limitations of the different approaches.
Introduction
Disulfide bridges formed between cysteine residues in peptides and proteins are fundamental building blocks for the molecular architecture and, thus, can govern basic biological processes. The formation of a disulfide bond by two side chain Sγ atoms of spatially proximal cysteines constitutes a two-electron oxidation process leading from reduced sulfhydryl groups of cysteines (S-H) to the oxidized cystine (S-S) residue. In cellular environments, this reaction is often supported and accelerated by enzymes like thioredoxin (Mahmood et al., 2013) or protein disulfide isomerases (Lee and Lee, 2017). Disulfide bridges can be formed intramolecular, in rarer cases even between two vicinal cysteines (Carugo et al., 2003), and constitute the only natural covalent link between polypeptides strands. In addition, they might occur as an intermolecular feature, sometimes leading to increased protein aggregation. Cleavage of disulfide bonds in biomolecules may result in the collapse of the native conformation and biological function. Thus, failures in formation or processing of disulfide bonds may lead to severe disorders by the accumulation of protein aggregates, by imposing cellular stress conditions and/or by leading to cell death (Rakhit and Chakrabartty, 2006; Hetz, 2012; Xu et al., 2014; Bechtel and Weerapana, 2017). Thus, nature has evolved a multitude of proteins with specialized biological functions based on molecular architectures involving different numbers of cystines. Only a few examples are Kunitz-type trypsin inhibitors (Otting et al., 1993; Cohen et al., 2019), multi-domain Kazal-type thrombin inhibitors like rhodniin (Van De Locht et al., 1995) and dipetalin (Schlott et al., 2002), growth factors (Christinger et al., 2004; Sitar et al., 2006), defensins (Szyk et al., 2006), neuropeptides like oxytocin (Bhaskaran et al., 1992) and vasopressin (Schmidt et al., 1991) or peptidic toxins (Elnahriry et al., 2019) and cyclotides (Park et al., 2017; De Veer et al., 2019; Huang et al., 2019).
Following the application of chiroptical techniques to uncover the structural features of a disulfide bond (Beychok, 1966; Van Wart et al., 1973; Menendez-Botet and Breslow, 1975), in the 1970s NMR spectroscopy started to emerge as method for structure determination of disulfide-bridged peptides and proteins (Ludescher and Schwyzer, 1971). Meanwhile, the power of this technique is highlighted, e.g., by the fact that 121/165 of 137/182 conotoxin structures deposited in the RCSB protein data bank or the ConoServer (Kaas et al., 2012), respectively, are NMR solution structures. NMR also allows to analyze structural and dynamic aspects of transient oxidative folding processes (Szekely et al., 2018) and to reveal conformational switching processes between disordered and folded states (Fraga et al., 2017) controlled by disulfide bridge formation. In the following sections, we present a short overview of NMR applications for the characterization of disulfide-bond containing biomolecules.
Cysteine Abundance Analysis of the Proteome
To emphasize the special role of cysteines as a structure-forming or catalytic unit in the context of an evolutionary process, we present a short analysis of proteomes from different domains of life. Questions that arise are: (I) how many proteins of a proteome contain cysteines, (II) what is the average number of cysteines and disulfide bonds in a protein, (III) are there differences in the protein length or overall amino acid distribution among proteins with and without cysteines, and (IV) does the occurrence of cysteines correlate with the accumulation of other amino acids or amino acid patterns around these cysteines? In a first step, we selected different representatives from the three domains of life (Archaea: T. gammatolerans, Bacteria: E. coli and Eukaryota: A. thaliana, D. melanogaster, S. cerevisiae, O. sativa, H. sapiens) for which single defined proteome data sets are available in the UniProt database (UniProt Consortium, 2019). Except for T. gammatolerans, each selected data set is classified as a reference proteome in UniProt. Proteins in the data set are either annotated as reviewed (manually annotated) or unreviewed (full manual annotation still pending). Besides, we examined a data set that comprises all reviewed records in UniProt (referred herein as Reviewed SwissProt).
Eighty-three percent of all proteins annotated as reviewed in UniProt contain at least one cysteine and the number of cysteines accounts for 1.38% of all amino acids (a.a.) (Table 1, Figure S3). The median length of coding sequences of proteins for all reviewed entries in UniProt is 294 a.a. The cysteine-containing proteins are, on average, significantly longer (329 a.a.) compared to proteins that carry no cysteine (141 a.a.). On average 3 cysteines are present in proteins included in the SwissProt data set and 4 cysteines if only cysteine-containing proteins are considered.
Table 1
| Species (Uniprot proteome ID) | Number of proteins / proteins with Cys (percent) | Number of amino acids all / number of Cys (percent) | Median protein length all / proteins with Cys / proteins without Cys | Median Cys per protein all / proteins with Cys | Number of reviewed proteins / proteins with disulfide-bond (percent) / proteins with at least one interchain disulfide-bond | Median length of disulfide-bond containing proteins | Median disulfide-bonds (max.) |
|---|---|---|---|---|---|---|---|
| Reviewed SwissProt | 561 176 / 464 173 (83%) | 201 585 439 / 2 787 012 (1.38%) | 294 / 329 / 141 | 3 / 4 | 561 176 / 33 995 (6%) / 3 309 | 296 | 2 (166) |
| A. thaliana (UP000006548) | 27 466 / 25 852 (94%) | 11 122 644 / 207 856 (1.87%) | 347 / 361 / 125 | 6 / 6 | 15 821 / 1 145 (7%) / 42 | 250 | 3 (8) |
| D. melanogaster (UP000000803) | 13 798 / 13 018 (94%) | 7 403 990 / 142 035 (1.92%) | 395 / 412 / 150 | 7 / 7 | 3 559 / 349 (10%) / 30 | 513 | 3 (16) |
| E. coli (UP000000625) | 4 391 / 3 694 (84%) | 1 354 362 / 15 752 (1.16%) | 271 / 296 / 137 | 3 / 3 | 4 389 / 98 (2%) / 7 | 284 | 1 (4) |
| H. sapiens (UP000005640) | 20 660 / 19 979 (97%) | 11 425 374 / 263 334 (2.30%) | 410 / 421 / 125 | 9 / 9 | 20 305 / 3 591 (18%) / 334 | 362 | 2 (159) |
| O. sativa (UP000059680) | 43 603 / 40 126 (92%) | 13 382 401 / 260 236 (1.94%) | 228 / 247 / 115 | 4 / 5 | 4 046 / 283 (7%) / 19 | 275 | 1 (16) |
| S. cerevisiae (UP000002311) | 6 049 / 5 470 (90%) | 2 936 363 / 37 272 (1.27%) | 396 / 428 / 163 | 5 / 5 | 6 049 / 93 (2%) / 15 | 261 | 2 (14) |
| T. gammatolerans (UP000001488) | 2 157 / 1 286 (60%) | 636 517 / 3 603 (0.57%) | 251 / 298 / 198 | 1 / 2 | 181 / 0 (0%) / 0 | - | - |
Proteomic analysis and disulfide bonds in reviewed proteins.
It is well-known that the median protein length in Eukaryotes is significantly longer than in Prokaryotes. Among Prokaryotes, Bacteria tend to have longer proteins, on average, than Archaea (Zhang, 2000; Skovgaard et al., 2001; Brocchieri and Karlin, 2005). Concerning the median protein length, the trends presented in Table 1 confirm the results observed by others (Zhang, 2000; Skovgaard et al., 2001; Brocchieri and Karlin, 2005) on a genomic level. With only a median protein length of 228 a.a. O. sativa significantly deviates from the average protein length of other eukaryotes. The genomic protein length distribution for each selected species is given in detail in Figure S5. Figures S7, S8 depict the genomic length distribution of cysteine-containing proteins and proteins without cysteines, respectively.
For a more realistic view of the median protein length and cysteine distribution in a cell/organism, the abundance weighted protein distribution is calculated and depicted (Table S1 and Figure S6). The protein abundance database [PAXdb, (Wang et al., 2015)], provides information about the whole genome protein abundance across different organisms and tissues. With the exceptions of T. gammatolerans and S. cerevisiae the abundance weighted median protein length is shorter compared with the genomic-based median protein length. Intriguingly, the abundance weighted median number of cysteines per protein is 4 to 5 in all selected eukaryotes and is lower than on the genetic level.
The frequency of cysteines seems to increase during evolution. While in T. gammatolerans only 60% of all proteins contain at least one cysteine, in eukaryotic proteomes, 92–97% of all proteins are cysteine-containing. This observation is also reflected in the species-specific cysteine percentage proportion of all amino acids (0.57% for T. gammatolerans and 2.30% for H. sapiens, Table 1 and Figure S3). Moreover, the median number of cysteines per protein tends to increase during evolution and reaches with 9 cysteines per protein in humans a maximum. For a detailed analysis of the genomic and abundance weighted cysteine distribution see Figures S9, S10, respectively. In the reviewed SwissProt data set the SCO-spondin proteins contain the highest number of cysteins [e.g., G. gallus: 584 cysteins (UniProtKB1: Q2PC93), H. sapiens: 563 cysteines (A2VEC9)]. It has to be noted that among the selected organisms the reference proteome of D. melanogaster includes a protein with 2647 cysteines (Dumpy, isoform Q; M9PB30). In contrast, the highest density of cysteines is observed in relatively short proteins/peptides. For example, conotoxins (P85019 or P0DPL4) and thiozillins (P0C8P6, P0C8P7) reveal with 46 and 43%, respectively, the highest content of cysteines. The “Small cysteine and glycine repeat-containing proteins” (e.g., A0A286YF46) and the “Keratin-associated proteins” (e.g., Q9BYQ5) show with ~40% the highest cysteine content in H. sapiens. If the difference in the amino acid distribution of non-cysteine-containing proteins compared to cysteine-containing proteins is considered (Figure S4), it is notable that, except for T. gammatolerans, in all data sets the leucine content is decreased, and at least one basic amino acid (lysine or arginine) content is increased, respectively. It is still subject to speculation if the structural or functional role of cysteines is compensated by an increase of, e.g., basic side-chain amino acids in non-cysteine-containing proteins.
In Figures S1, S2 we present the position-dependent amino acid frequency in cysteine-containing proteins. In each protein, which carries a cysteine, the amino acid distribution at each position N- and C-terminal stepwise next to cysteine is determined and compared to the overall amino acid distribution. The normalization is achieved by calculating the distribution ratio (amino acid distribution at position n/overall amino acid distribution). A normalized occurrence (distribution ratio) >1 implies a higher amino acid frequency at this position than expected from the overall distribution. The reverse is valid for a distribution ratio <1. It becomes clear that besides cysteine, mainly aromatic amino acids are more frequent around cysteines in all selected data sets. Particularly in the H. sapiens proteome the amino acids phenylalanine, histidine, and tyrosine reveal a more frequent pattern around cysteines than expected. These findings may reflect the widespread zinc finger structural motif.
Disulfide bonds are a central structural element which stabilizes the mature proteins' 3D structure and/or exhibit physiologically relevant redox activity (Bosnjak et al., 2014). They are mostly found in secretory proteins and extracellular domains of membrane proteins. Table 1 and Figures S11, S12 compile some statistical information about reviewed proteins with disulfide bonds. In the reviewed SwissProt data set, 6% of all proteins contain at least one disulfide bridge, and the median number of disulfide bonds is 2. As already mentioned above, for the content of cysteines, the conotoxins (e.g., P0DL39, P50983) also show with ~20% the highest content of disulfide bonds for all reviewed UniProt entries.
For the selected data sets, the content of proteins with at least one intra-chain disulfide bond increase during evolution (Table 1). Eighteen percent of all reviewed human proteins bear at least one disulfide bond. The maximal number of cystins/disulfide bonds currently observed in human proteins is 159 (Prolow-density lipoprotein receptor-related protein 1; 4544 a.a. in its canonical form; Q07954). However, as this protein contains 331 cysteines, it immediately becomes clear that not all of them under the same physical conditions form intramolecular disulfide pairs. When normalized by length, the shorter WAP four-disulfide core domain protein 3 (Q8IUB2) with 231 a.a. and 16 disulfide bonds takes over the pole position with ~7 bridges per 100 amino acids. In contrast, in T. gammatolerans no disulfide bonds are known for the reviewed proteins. The observation that the cysteine content in proteins increases during evolution can't be transferred clearly to the median number of disulfide bonds. In H. sapiens the median number of disulfide bonds is 2, whereas in S. cerevisiae it is also 2, but for D. melanogaster it is 3.
NMR Spectroscopy & Prediction Techniques
Structurally, the disulfide linkage in a cystine displays a typical bond length of ~2.04 Å (Chaney and Steinrauf, 1974). The chirality of the disulfide linkage is a stereo-electronic consequence of the four free electron pairs on the two sulfur atoms. These electron pairs interact by repulsive forces with the neighboring β-carbon-containing groups, basically allowing two energetically favorable, mirror-imaged, and equally populated conformations for the C-S-S-C torsion angle (χS−S; Figures 1A,B) (Panijpan, 1977; Thornton, 1981). A newer study of 1,505 native disulfide bonds reported the average values of the χS−S torsion to be around−87° (left-handed) and +97° (right-handed) (Craig and Dombkowski, 2013). These torsion angle values are rather exceptional when compared to the other naturally occurring amino acids in peptides as those populate mainly side-chain torsions in the trans/anti (180°) or gauche (±60°) conformational range. In contrast to the redox state, no reliable prediction of the χS−S torsion angle from chemical shifts is available. However, the web-based approach “Disulfide by Design 2.0” (DbD2) (Craig and Dombkowski, 2013) allowed to correctly predict 96% of the disulfide chiralities based on an energy function reflecting the geometric characteristics found in an analysis of disulfide bonds in the PDB. Armstrong et al. (2018) recently reported about a prediction algorithm (DISH) for the two cysteine side-chain torsion angles χ2 and χ1 using a support vector machine. This approach had an overall accuracy of 81% for simultaneous prediction of both torsions and allowed to considerably reduce the spread in the protein backbone conformations in subsequent structure calculations.
Figure 1

Chiralities of the disulfide bridge with χS−S torsional angle values of −90° (A) and +90° (B). Coloring: H—cyan, C—gray, O—red, N—blue, S—yellow, free electron pairs—magenta. In (A) distance relations leading to typical cross peaks in NOESY-type spectra are marked (small arrows). Distance relations originating either from the Hα (orange) or the Hβ protons (green) are indicated only for one of the two cysteines. (C,D) Chemical shift correlation of cysteine Cβ and Hβ2(C), Hβ3(D), respectively. Chemical shift data and correlations are obtained and visualized from the Biological Magnetic Resonance Data Bank (BMRB) using a modified PyBMRB python module. Distribution values which are outside 10 times the standard deviation were removed from each correlation data set. Contour levels reflect the total number of correlations within.
With the advent of heteronuclear NMR techniques, analyses of the 13C chemical shift values of oxidized (S-S) or reduced (S-H) cysteines became available. Based on the 13Cα and 13Cβ chemical shift data it could be deduced that the redox state is reflected in a distinct chemical shift pattern leading to two mainly non-overlapping areas for Cβ-shifts. These findings allowed the authors to suggest the following basic rule: “If the Cβ shift is <32.0 ppm or >35.0 ppm, the redox state is assigned to reduced or oxidized, respectively” (Sharma and Rajarathnam, 2000). This empirical analysis was later supported by results of quantum chemical calculations of cysteine chemical shifts (Martin et al., 2010), which also rendered the 13Cα chemical shift value insensitive for an assignment of the redox state. As introduced above, disulfide bridges favor two distinct chiralities. Figures 1C,D shows the chemical shift distributions of the Cβ vs. Hβ2/3 protons and Figure S13 for the other NMR active cysteine nuclei based on the actual BMRB data. It indicates that the Cβ/Cα distribution can be a supportive information for revealing the cysteines' redox state.
In addition to a pure NOE-based NMR structure determination, the measurement of residual dipolar couplings (RDC) allows to improve the resolution of 3D structures in case isotopically labeled compounds are available. Recently, the spider venom disulfide-rich peptide Ta1a was refined to a resolution of ~1.5 Å applying this approach (Ramanujam et al., 2020).
Lately, combining seleno-cysteine scanning and NMR analysis was shown to be a reliable approach for mapping disulfide bonds in cysteine-rich peptides and proteins (Denisov et al., 2019). The structurally conservative selenium substitution causes selective chemical shift changes of cysteine carbons involved in the mixed S–Se bond allowing identification by visual comparison of [1H,13C]-HSQC spectra of native and Sec-mutants.
Conotoxins & Granulins
Conotoxins, small disulfide bridge-containing peptides found in marine cone snails, have attracted considerable scientific interest as they bind to ion channels. The pharmacological potential to modulate or block the ion channel activity and their synthetic availability make conotoxins promising candidates for new analgesics. However, Heimer et al. recently showed on the example of μ-PIIIA (three disulfide bonds) the complexity of the synthesis, purification, and analytical characterization of one specific isomer in the multitude of different potentially formed disulfide-bridged isomers of those cysteine-rich peptides (Heimer et al., 2018). With respect to this, ionic liquids have proven to be a promising solvent for controlling the oxidative folding process (Miloslavina et al., 2010).
The impact of deletion of disulfide bonds on the activity of α-conotoxins (two disulfide bonds) at human neuronal nicotinic acetylcholine receptors was studied employing NMR, Molecular Dynamics simulations and voltage-clamp techniques (Tabassum et al., 2017). The data supports the notion that the two disulfide bonds have been selectively conserved to create and stabilize a structural scaffold optimized for receptor binding.
Two recent publications presented structural relatedness between conotoxin structures and the granulin module, which was also solved by NMR and typically contains six disulfide bridges (Hrabal et al., 1996). For conotoxin Φ-MiXXVIIA a novel cysteine framework mimicking granulin and displaying anti-apoptotic activity was observed (Jin et al., 2017). Also for the conotoxin NextH-Vc7.2 with three disulfide bridges the NMR structure determination revealed a granulin-like fold arising from the common inhibitor cystine knot framework (Nielsen et al., 2019). Based on further occurrences of this motif, e.g., in αD-GeXXa conotoxin, the authors conclude that the fold comprising two short, stacked β-hairpins stabilized by two parallel disulfide bonds might be an autonomous folding unit.
Kazal-, Kunitz-, and Defensin-Type Folds
From earlier studies it is known that protease inhibitors, e.g., the thrombin inhibitors rhodniin (Van De Locht et al., 1995) and dipetalin (Icke et al., 2002), are composed of (repetitive) Kazal-type domains structurally shaped by three disulfide bridges. Recently, the NMR structure of CmPI-II, an inhibitor of trypsin, human neutrophil elastase, and subtilisin A, was elucidated and the complex with the latter modeled (Cabrera-Munoz et al., 2019). Similar to the recent structural study on SPINK6 (Jung et al., 2016) from the serine protease inhibitors of Kazal-type family (SPINK) (Feng et al., 2012), the authors describe a flexible N-terminal region and attribute the P2 site potential for alternative interactions in the complex formation.
Kunitz-type proteins, with bovine basic pancreatic trypsin inhibitor (BPTI) as the most extensively studied member (Berndt et al., 1992), display a compact conformation stabilized by three strongly conserved intra-chain disulfide bonds. Recently, (Banijamali et al., 2019) presented an NMR characterization of Pseudocerastes persicus trypsin inhibitor (PPTI) sharing structural similarities with dendrotoxins. By successive modeling, they could show that PPTI might block Kv1.1 potassium channels with the same mechanism as dendrotoxins. Also, Ixolaris, a potent tick salivary anticoagulant binding the coagulation factor Xa and the zymogen FX, shows a canonical Kunitz 3D structure (De Paula et al., 2019). However, the NMR and modeling results indicate that it exhibits a non-canonical inhibition interaction outside the active site of FX.
The pacifastin family of serine protease inhibitors found in animals and plants comprises short proteins exhibiting three β-strands which are again stabilized by three disulfide bridges (Simonet et al., 2002; Gaspari et al., 2004). These structural features can induce a stable, compact core and an extended binding loop.
Another peptide class displaying three disulfide linkages are defensins. Depending on the spacing of the cysteines and their pairing, three subfamilies (α, β, θ) are defined. Molecules of these classes share a similar structural fold (Lehrer and Lu, 2012; Dias Rde and Franco, 2015) and are facing interest as promising alternatives to conventional antibiotics. Recently, the NMR solution structure of rattusin expanded the structural repertoire of defensins by a scaffold formed by intermolecular disulfide exchanges between dimer units (Min et al., 2017).
Kinases and Phosphatases
The C-terminal Src kinase (Csk) is a member of the CSK family of protein tyrosine kinases, which contains an SH2 domain carrying a unique disulfide bond which regulates the Csk kinase activity (Mills et al., 2007). The kinase activity of Csk was found to be strongly reduced upon the SH2 disulfide bond formation. Liu and Cowburn (2016) observed from X-ray data that only minor structural changes in the SH2 domain resulted from the disulfide bond formation. However, NMR measurements indicated that the reduced SH2 could bind slightly more efficiently with a Csk-binding protein-phosphorylated peptide.
Fms-like tyrosine kinase 3 is a member of the PDGFR (class III RTK) family containing disulfide bridges as well as free cysteines. By serine replacement of cytoplasmic cysteines evidence was found that oxidative modification of cysteine residues, e.g., by exogenous ROS, regulates the kinase activity of this clinically important oncoprotein (Bohmer et al., 2019).
For the closely-related members of the KIM-family protein-tyrosine phosphatases (PTP) (Machado et al., 2017) found significant differences in oxidation profiles coming along with different stabilization mechanisms. Whereas, striatal-enriched PTP and PTP-receptor type R stabilize their reversibly oxidized state by forming an intramolecular disulfide bond, in hematopoietic PTP the unexpected formation of a reversible intermolecular disulfide bond was observed.
Concluding Remarks
The cited examples illustrate that cysteine disulfide bridging is an essential and highly evolved natural feature for the stabilization of peptide and protein structures and for modulation of biological activities. This finding is underlined by the extraordinary distributions of cysteines found in the proteomic data of different species/kingdoms. Current NMR and X-ray techniques allow defining the molecular structures of disulfide-rich biomolecules in high resolution. As disulfide bridges constitute the only natural covalent link between polypeptides strands, the acquired knowledge on their contribution to molecular scaffolding supports engineering of new cystine-based compounds with new functional (Nagarajan et al., 2018) or dynamical features (Gutmans et al., 2019), enhanced stability (Dombkowski et al., 2014), ultimately, aiming at improved pharmaco-kinetic and -dynamic properties for new therapies and treatment approaches. However, disulfide bonds tend to be unstable under reducing conditions, i.e., in many physiological situations, which triggered search for therapeutic compounds to make use of chemical modifications to stably replace these bonds. Thus, stable, non-reducible dicarba-bridged analogs were reported e.g., for oxytocin (Stymiest et al., 2003), for α-conotoxins of subtypes α-ImI, Vc1.1 and RgIA (MacRaild et al., 2009; Van Lierop et al., 2013; Chhabra et al., 2014) or, recently, insulin (Van Lierop et al., 2017).
Statements
Author contributions
CW, AK, AL, and OO equally contributed to the preparation of the manuscript. OO approved the final version.
Acknowledgments
We acknowledge the financial support of the Martin Luther University Halle-Wittenberg within the funding program Open Access Publishing by the German Research Foundation (DFG).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fchem.2020.00280/full#supplementary-material
- NMR
nuclear magnetic resonance
- a. a.
amino acids.
Abbreviations
Footnotes
1.^In the following text the UniProtKB codes are referenced in brackets.
References
1
ArmstrongD. A.KaasQ.RosengrenK. J. (2018). Prediction of disulfide dihedral angles using chemical shifts. Chem. Sci.9, 6548–6556. 10.1039/C8SC01423J
2
BanijamaliS. E.AmininasabM.ZaeifiD. (2019). Structural characterization of PPTI, a kunitz-type protein from the venom of Pseudocerastes persicus. PLoS ONE14:e0214657. 10.1371/journal.pone.0214657
3
BechtelT. J.WeerapanaE. (2017). From structure to redox: the diverse functional roles of disulfides and implications in disease. Proteomics.17:1600391. 10.1002/pmic.201600391
4
BerndtK. D.GuntertP.OrbonsL. P.WuthrichK. (1992). Determination of a high-quality nuclear magnetic resonance solution structure of the bovine pancreatic trypsin inhibitor and comparison with three crystal structures. J. Mol. Biol.227, 757–775. 10.1016/0022-2836(92)90222-6
5
BeychokS. (1966). Circular dichroism of biological macromolecules. Science154, 1288–1299. 10.1126/science.154.3754.1288
6
BhaskaranR.ChuangL. C.YuC. (1992). Conformational properties of oxytocin in dimethyl sulfoxide solution: NMR and restrained molecular dynamics studies. Biopolymers32, 1599–1608. 10.1002/bip.360321203
7
BohmerA.BarzS.SchwabK.KolbeU.GabelA.KirkpatrickJ.et al. (2019). Modulation of FLT3 signal transduction through cytoplasmic cysteine residues indicates the potential for redox regulation. Redox Biol.28:101325. 10.1016/j.redox.2019.101325
8
BosnjakI.BojovicV.Segvic-BubicT.BielenA. (2014). Occurrence of protein disulfide bonds in different domains of life: a comparison of proteins from the Protein Data Bank. Protein Eng. Des. Sel.27, 65–72. 10.1093/protein/gzt063
9
BrocchieriL.KarlinS. (2005). Protein length in eukaryotic and prokaryotic proteomes. Nucleic Acids Res.33, 3390–3400. 10.1093/nar/gki615
10
Cabrera-MunozA.ValienteP. A.RojasL.Alonso-Del-Rivero AntiguaM.PiresJ. R. (2019). NMR structure of CmPI-II, a non-classical Kazal protease inhibitor: understanding its conformational dynamics and subtilisin A inhibition. J. Struct. Biol.206, 280–294. 10.1016/j.jsb.2019.03.011
11
CarugoO.CemazarM.ZaharievS.HudakyI.GaspariZ.PerczelA.et al. (2003). Vicinal disulfide turns. Protein Eng.16, 637–639. 10.1093/protein/gzg088
12
ChaneyM. O.SteinraufL. K. (1974). The crystal and molecular structure of tetragonal l-cystine. Acta Crystallographica Section B30, 711–716. 10.1107/S0567740874003566
13
ChhabraS.BelgiA.BartelsP.Van LieropB. J.RobinsonS. D.KompellaS. N.et al. (2014). Dicarba analogues of alpha-conotoxin RgIA. Structure, stability, and activity at potential pain targets. J. Med. Chem.57, 9933–9944. 10.1021/jm501126u
14
ChristingerH. W.FuhG.De VosA. M.WiesmannC. (2004). The crystal structure of placental growth factor in complex with domain 2 of vascular endothelial growth factor receptor-1. J. Biol. Chem.279, 10382–10388. 10.1074/jbc.M313237200
15
CohenI.CobanM.ShaharA.SankaranB.HocklaA.LachamS.et al. (2019). Disulfide engineering of human Kunitz-type serine protease inhibitors enhances proteolytic stability and target affinity toward mesotrypsin. J. Biol. Chem.294, 5105–5120. 10.1074/jbc.RA118.007292
16
CraigD. B.DombkowskiA. A. (2013). Disulfide by design 2.0: a web-based tool for disulfide engineering in proteins. BMC Bioinformatics14:346. 10.1186/1471-2105-14-346
17
De PaulaV. S.SgourakisN. G.FrancischettiI. M. B.AlmeidaF. C. L.MonteiroR. Q.ValenteA. P. (2019). NMR structure determination of Ixolaris and factor X(a) interaction reveals a noncanonical mechanism of Kunitz inhibition. Blood134, 699–708. 10.1182/blood.2018889493
18
De VeerS. J.KanM. W.CraikD. J. (2019). Cyclotides: from structure to function. Chem. Rev.119, 12375–12421. 10.1021/acs.chemrev.9b00402
19
DenisovS. S.IppelJ. H.MansB. J.DijkgraafI.HackengT. M. (2019). SecScan: a general approach for mapping disulfide bonds in synthetic and recombinant peptides and proteins. Chem. Commun.55, 1374–1377. 10.1039/C8CC08777F
20
Dias RdeO.FrancoO. L. (2015). Cysteine-stabilized alphabeta defensins: From a common fold to antibacterial activity. Peptides72, 64–72. 10.1016/j.peptides.2015.04.017
21
DombkowskiA. A.SultanaK. Z.CraigD. B. (2014). Protein disulfide engineering. FEBS Lett.588, 206–212. 10.1016/j.febslet.2013.11.024
22
ElnahriryK. A.WaiD. C. C.KrishnarjunaB.BadawyN. N.ChittoorB.MacraildC. A.et al. (2019). Structural and functional characterisation of a novel peptide from the Australian sea anemone Actinia tenebrosa. Toxicon168, 104–112. 10.1016/j.toxicon.2019.07.002
23
FengY.GengY.ZhouT.WangJ. (2012). NMR structure note: human esophageal cancer-related gene 2. J. Biomol. NMR53, 65–70. 10.1007/s10858-012-9622-9
24
FragaH.PujolsJ.Gil-GarciaM.RoqueA.Bernardo-SeisdedosG.SantambrogioC.et al. (2017). Disulfide driven folding for a conditionally disordered protein. Sci. Rep.7:16994. 10.1038/s41598-017-17259-4
25
GaspariZ.OrtutayC.PerczelA. (2004). A simple fold with variations: the pacifastin inhibitor family. Bioinformatics20, 448–451. 10.1093/bioinformatics/btg451
26
GutmansD. S.WhittakerS. B.AsianiK.AtkinsonR. A.OregioniA.PfuhlM. (2019). Controlling the dynamics of the Nek2 leucine zipper by engineering of “kinetic” disulphide bonds. PLoS ONE14:e0210352. 10.1371/journal.pone.0210352
27
HeimerP.TietzeA. A.BaumlC. A.ResemannA.MayerF. J.SuckauD.et al. (2018). Conformational mu-Conotoxin PIIIA isomers revisited: impact of cysteine pairing on disulfide-bond assignment and structure elucidation. Anal. Chem.90, 3321–3327. 10.1021/acs.analchem.7b04854
28
HetzC. (2012). The unfolded protein response: controlling cell fate decisions under ER stress and beyond. Nat. Rev. Mol. Cell Biol.13, 89–102. 10.1038/nrm3270
29
HrabalR.ChenZ.JamesS.BennettH. P.NiF. (1996). The hairpin stack fold, a novel protein architecture for a new family of protein growth factors. Nat. Struct. Biol.3, 747–752. 10.1038/nsb0996-747
30
HuangY. H.DuQ.CraikD. J. (2019). Cyclotides: disulfide-rich peptide toxins in plants. Toxicon172, 33–44. 10.1016/j.toxicon.2019.10.244
31
IckeC.SchlottB.OhlenschlagerO.HartmannM.GuhrsK. H.GlusaE. (2002). Fusion proteins with anticoagulant and fibrinolytic properties: functional studies and structural considerations. Mol. Pharmacol.62, 203–209. 10.1124/mol.62.2.203
32
JinA. H.DekanZ.SmoutM. J.WilsonD.DutertreS.VetterI.et al. (2017). Conotoxin Phi-MiXXVIIA from the superfamily G2 employs a novel cysteine framework that mimics granulin and displays anti-apoptotic activity. Angew. Chem. Int. Ed. Engl.56, 14973–14976. 10.1002/anie.201708927
33
JungS.FischerJ.SpudyB.KerkowT.SonnichsenF. D.XueL.et al. (2016). The solution structure of the kallikrein-related peptidases inhibitor SPINK6. Biochem. Biophys. Res. Commun.471, 103–108. 10.1016/j.bbrc.2016.01.172
34
KaasQ.YuR.JinA. H.DutertreS.CraikD. J. (2012). ConoServer: updated content, knowledge, and discovery tools in the conopeptide database. Nucleic Acids Res.40, D325–D330. 10.1093/nar/gkr886
35
LeeE.LeeD. H. (2017). Emerging roles of protein disulfide isomerase in cancer. BMB Rep.50, 401–410. 10.5483/BMBRep.2017.50.8.107
36
LehrerR. I.LuW. (2012). alpha-Defensins in human innate immunity. Immunol. Rev.245, 84–112. 10.1111/j.1600-065X.2011.01082.x
37
LiuD.CowburnD. (2016). Combining biophysical methods to analyze the disulfide bond in SH2 domain of C-terminal Src kinase. Biophys. Rep.2, 33–43. 10.1007/s41048-016-0025-4
38
LudescherU.SchwyzerR. (1971). On the chirality of the cystine disulfide group: assignment of helical sense in a model compound with a dihedral angel greater than ninety degrees using NMR. and CD. Helv. Chim. Acta54, 1637–1644. 10.1002/hlca.19710540615
39
MachadoL.ShenT. L.PageR.PetiW. (2017). The KIM-family protein-tyrosine phosphatases use distinct reversible oxidation intermediates: Intramolecular or intermolecular disulfide bond formation. J. Biol. Chem.292, 8786–8796. 10.1074/jbc.M116.774174
40
MacRaildC. A.IllesingheJ.Van LieropB. J.TownsendA. L.ChebibM.LivettB. G.et al. (2009). Structure and activity of (2,8)-dicarba-(3,12)-cystino alpha-ImI, an alpha-conotoxin containing a nonreducible cystine analogue. J. Med. Chem.52, 755–762. 10.1021/jm8011504
41
MahmoodD. F.AbderrazakA.El HadriK.SimmetT.RouisM. (2013). The thioredoxin system as a therapeutic target in human health and disease. Antioxid Redox. Signal.19, 1266–1303. 10.1089/ars.2012.4757
42
MartinO. A.VillegasM. E.VilaJ. A.ScheragaH. A. (2010). Analysis of 13Calpha and 13Cbeta chemical shifts of cysteine and cystine residues in proteins: a quantum chemical approach. J. Biomol. NMR46, 217–225. 10.1007/s10858-010-9396-x
43
Menendez-BotetC. J.BreslowE. (1975). Chemical and physical properties of the disulfides of bovine neurophysin-II. Biochemistry14, 3825–3835. 10.1021/bi00688a015
44
MillsJ. E.WhitfordP. C.ShafferJ.OnuchicJ. N.AdamsJ. A.JenningsP. A. (2007). A novel disulfide bond in the SH2 Domain of the C-terminal Src kinase controls catalytic activity. J. Mol. Biol.365, 1460–1468. 10.1016/j.jmb.2006.10.076
45
MiloslavinaA.EbertC.TietzeD.OhlenschlagerO.EnglertC.GorlachM.et al. (2010). An unusual peptide from Conus villepinii: synthesis, solution structure, and cardioactivity. Peptides31, 1292–1300. 10.1016/j.peptides.2010.04.002
46
MinH. J.YunH.JiS.RajasekaranG.KimJ. I.KimJ. S.et al. (2017). Rattusin structure reveals a novel defensin scaffold formed by intermolecular disulfide exchanges. Sci. Rep.7:45282. 10.1038/srep45282
47
NagarajanD.SukumaranS.DekaG.KrishnamurthyK.AtreyaH. S.ChandraN. (2018). Design of a heme-binding peptide motif adopting a beta-hairpin conformation. J. Biol. Chem.293, 9412–9422. 10.1074/jbc.RA118.001768
48
NielsenL. D.FogedM. M.AlbertA.BertelsenA. B.SoltoftC. L.RobinsonS. D.et al. (2019). The three-dimensional structure of an H-superfamily conotoxin reveals a granulin fold arising from a common ICK cysteine framework. J. Biol. Chem.294, 8745–8759. 10.1074/jbc.RA119.007491
49
OttingG.LiepinshE.WuthrichK. (1993). Disulfide bond isomerization in BPTI and BPTI(G36S): an NMR study of correlated mobility in proteins. Biochemistry32, 3571–3582. 10.1021/bi00065a008
50
PanijpanB. (1977). Chirality of the disulfide bond in biomolecules. J. Chem. Educ.54, 670–672. 10.1021/ed054p670
51
ParkS.YooK. O.MarcussenT.BacklundA.JacobssonE.RosengrenK. J.et al. (2017). Cyclotide evolution: insights from the analyses of their precursor sequences, structures and distribution in violets (Viola). Front. Plant Sci.8:2058. 10.3389/fpls.2017.02058
52
RakhitR.ChakrabarttyA. (2006). Structure, folding, and misfolding of Cu,Zn superoxide dismutase in amyotrophic lateral sclerosis. Biochim. Biophys. Acta1762, 1025–1037. 10.1016/j.bbadis.2006.05.004
53
RamanujamV.ShenY.YingJ.MobliM. (2020). Residual dipolar couplings for resolving cysteine bridges in disulfide-rich peptides. Front. Chem.7:889. 10.3389/fchem.2019.00889
54
SchlottB.WohnertJ.IckeC.HartmannM.RamachandranR.GuhrsK. H.et al. (2002). Interaction of Kazal-type inhibitor domains with serine proteinases: biochemical and structural studies. J. Mol. Biol.318, 533–546. 10.1016/S0022-2836(02)00014-1
55
SchmidtJ. M.OhlenschlagerO.RuterjansH.GrzonkaZ.KojroE.PavoI.et al. (1991). Conformation of [8-arginine]vasopressin and V1 antagonists in dimethyl sulfoxide solution derived from two-dimensional NMR spectroscopy and molecular dynamics simulation. Eur. J. Biochem.201, 355–371. 10.1111/j.1432-1033.1991.tb16293.x
56
SharmaD.RajarathnamK. (2000). 13C NMR chemical shifts can predict disulfide bond formation. J. Biomol. NMR18, 165–171. 10.1023/A:1008398416292
57
SimonetG.ClaeysI.BroeckJ. V. (2002). Structural and functional properties of a novel serine protease inhibiting peptide family in arthropods. Comp. Biochem. Physiol. B Biochem. Mol. Biol.132, 247–255. 10.1016/S1096-4959(01)00530-9
58
SitarT.PopowiczG. M.SiwanowiczI.HuberR.HolakT. A. (2006). Structural basis for the inhibition of insulin-like growth factors by insulin-like growth factor-binding proteins. Proc. Natl. Acad. Sci. U.S.A.103, 13028–13033. 10.1073/pnas.0605652103
59
SkovgaardM.JensenL. J.BrunakS.UsseryD.KroghA. (2001). On the total number of genes and their length distribution in complete microbial genomes. Trends Genet.17, 425–428. 10.1016/S0168-9525(01)02372-1
60
StymiestJ. L.MitchellB. F.WongS.VederasJ. C. (2003). Synthesis of biologically active dicarba analogues of the peptide hormone oxytocin using ring-closing metathesis. Org. Lett.5, 47–49. 10.1021/ol027160v
61
SzekelyO.ArmonyG.OlsenG. L.BigmanL. S.LevyY.FassD.et al. (2018). Identification and rationalization of kinetic folding intermediates for a low-density lipoprotein receptor ligand-binding module. Biochemistry57, 4776–4787. 10.1021/acs.biochem.8b00466
62
SzykA.WuZ.TuckerK.YangD.LuW.LubkowskiJ. (2006). Crystal structures of human alpha-defensins HNP4, HD5, and HD6. Protein Sci.15, 2749–2760. 10.1110/ps.062336606
63
TabassumN.TaeH. S.JiaX.KaasQ.JiangT.AdamsD. J.et al. (2017). Role of CysI-CysIII disulfide bond on the structure and activity of alpha-conotoxins at human neuronal nicotinic acetylcholine receptors. ACS Omega2, 4621–4631. 10.1021/acsomega.7b00639
64
ThorntonJ. M. (1981). Disulphide bridges in globular proteins. J. Mol. Biol.151, 261–287. 10.1016/0022-2836(81)90515-5
65
UniProt Consortium (2019). UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res.47, D506–D515. 10.1093/nar/gky1049
66
Van De LochtA.LambaD.BauerM.HuberR.FriedrichT.KrogerB.et al. (1995). Two heads are better than one: crystal structure of the insect derived double domain Kazal inhibitor rhodniin in complex with thrombin. EMBO J.14, 5149–5157. 10.1002/j.1460-2075.1995.tb00199.x
67
Van LieropB.OngS. C.BelgiA.DelaineC.AndrikopoulosS.HaworthN. L.et al. (2017). Insulin in motion: The A6-A11 disulfide bond allosterically modulates structural transitions required for insulin activity. Sci. Rep.7:17239. 10.1038/s41598-017-16876-3
68
Van LieropB. J.RobinsonS. D.KompellaS. N.BelgiA.McarthurJ. R.HungA.et al. (2013). Dicarba alpha-conotoxin Vc1.1 analogues with differential selectivity for nicotinic acetylcholine and GABAB receptors. ACS Chem. Biol.8, 1815–1821. 10.1021/cb4002393
69
Van WartH. E.LewisA.ScheragaH. A.SaevaF. D. (1973). Disulfide bond dihedral angles from Raman spectroscopy. Proc. Natl. Acad. Sci. U.S.A.70, 2619–2623. 10.1073/pnas.70.9.2619
70
WangM.HerrmannC. J.SimonovicM.SzklarczykD.Von MeringC. (2015). Version 4.0 of PaxDb: Protein abundance data, integrated across model organisms, tissues, and cell-lines. Proteomics15, 3163–3168. 10.1002/pmic.201400441
71
XuS.SankarS.NeamatiN. (2014). Protein disulfide isomerase: a promising target for cancer therapy. Drug Discov. Today19, 222–240. 10.1016/j.drudis.2013.10.017
72
ZhangJ. (2000). Protein-length distributions for the three domains of life. Trends Genet.16, 107–109. 10.1016/S0168-9525(99)01922-8
Summary
Keywords
disulfide bridge, cystine, protein, peptide, NMR, spectroscopy
Citation
Wiedemann C, Kumar A, Lang A and Ohlenschläger O (2020) Cysteines and Disulfide Bonds as Structure-Forming Units: Insights From Different Domains of Life and the Potential for Characterization by NMR. Front. Chem. 8:280. doi: 10.3389/fchem.2020.00280
Received
04 November 2019
Accepted
23 March 2020
Published
23 April 2020
Volume
8 - 2020
Edited by
Durba Roy, Birla Institute of Technology and Science, India
Reviewed by
Diego Brancaccio, University of Naples Federico II, Italy; Mateus Webba Da Silva, Ulster University, United Kingdom
Updates
Copyright
© 2020 Wiedemann, Kumar, Lang and Ohlenschläger.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Christoph Wiedemann christoph.wiedemann@biochemtech.uni-halle.deOliver Ohlenschläger oliver.ohlenschlaeger@leibniz-fli.de
This article was submitted to Medicinal and Pharmaceutical Chemistry, a section of the journal Frontiers in Chemistry
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.