Identification of multiple physicochemical and structural properties associated with soluble expression of eukaryotic proteins in cell-free bacterial extracts

Tokmakov, Alexander A.

doi:10.3389/fmicb.2014.00295

METHODS article

Front. Microbiol., 20 June 2014

Sec. Physiology and Metabolism of Microorganisms

Volume 5 - 2014 | https://doi.org/10.3389/fmicb.2014.00295

Identification of multiple physicochemical and structural properties associated with soluble expression of eukaryotic proteins in cell-free bacterial extracts

AA
Alexander A. Tokmakov ^*

Research Center for Environmental Genomics, Kobe University Kobe, Japan

Abstract

Bacterial extracts are widely used to synthesize recombinant proteins. Vast data volumes have been accumulated in cell-free expression databases, covering a whole range of existing proteins. It makes possible comprehensive bioinformatics analysis and identification of multiple features associated with protein solubility and aggregation. In the present paper, an approach to identify the multiple physicochemical and structural properties of amino acid sequences associated with soluble expression of eukaryotic proteins in cell-free bacterial extracts is presented. The method includes: (1) categorical assessment of expression data; (2) calculation and prediction of multiple properties of expressed sequences; (3) correlation of the individual properties with the expression scores; and (4) evaluation of statistical significance of the observed correlations. Using this method, a number of significant correlations between calculated and predicted properties of amino acid sequences and their propensity for soluble cell-free expression have been revealed.

INTRODUCTION

Heterologous protein synthesis is widely used for production of recombinant proteins. Particularly, eukaryotic proteins and their domains are often expressed in bacterial hosts (Yokoyama, 2003; Sorensen and Mortensen, 2005; Sivashanmugam et al., 2009; Chen, 2012). However, only a minor fraction of all proteins can be successively produced in bacterial host systems. Presently, the factors determining expression success in these systems are poorly understood. Various physicochemical features of an amino acid sequence have been implicated as determining factors of soluble protein expression in bacteria (Bertone et al., 2001; Dyson et al., 2004; Goh et al., 2004; Idicula-Thomas and Balaji, 2005).

Recently, cell-free systems of protein synthesis have been developed that offer numerous advantages over cell-based expression (reviewed in Spirin, 2004; Katzen et al., 2005; He, 2008). The cell-free systems allow genome-scale expression of various amino acid sequences under strictly controlled uniform conditions. The productivity of bacterial cell-free synthesis reaches several milligrams of protein per milliliter of reaction mixture (Kigawa et al., 1999). Most often, the purpose of heterologous cell-free synthesis is to produce properly folded and functionally active protein product in the amounts sufficient for structural and functional studies. However, the folding of eukaryotic proteins is greatly compromised in bacterial extracts due to intrinsic differences between the cytoplasmic environments of prokaryotic and eukaryotic cells. Moreover, many eukaryotic proteins require multiple post-translational modifications (PTMs) to attain a native, biologically active state. However, the bacterial expression systems have only a limited capacity for PTMs.

In the present paper, we describe an approach aimed at identification of numerous physicochemical, structural and functional properties of amino acid sequences, including the sites of multiple PTMs, associated with soluble expression of eukaryotic proteins in bacterial cell-free extracts, and highlight major correlations obtained using this approach.

METHOD

METHOD OVERVIEW

The developed method is intended for analysis of output from an existing cell-free protein production pipeline. Thus, this paper does not cover the experimental workflow of protein production. It is described in detail in the previous publications (Yabuki et al., 2007; Kigawa et al., 2008; Kurotani et al., 2010; Tokmakov et al., 2012). Here, the focus is set on the processing of experimental data with the purpose of identification of multiple physicochemical and structural properties associated with soluble expression of eukaryotic proteins in cell-free bacterial extracts. Important for the developed approach is that all the proteins in the analyzed dataset are expressed under the same uniform set of conditions. This minimizes the influence of sequence-independent factors and makes possible adequate categorical assessment of expression data (see Categorical Assessment of Expression Data section). The affinity purification tags should be avoided in the expressed sequences because they hinder the analysis of expression correlations by decreasing the role of sequence-specific determinants.

The main steps of the proposed method are summarized in Figure 1. They include: (1) categorical assessment of the experimental results of protein expression; (2) determination of multiple physicochemical and structural properties of the expressed amino acid sequences using computational and predictive bioinformatics tools; (3) correlation of the individual protein properties with the experimental expression scores; and (4) evaluation of statistical significance of the observed correlations. The developed approach has been extensively used to analyze experimental expression of human proteins and their domains in Escherichia coli bacterial extracts (Kurotani et al., 2010; Tokmakov et al., 2012; see Results and Discussion section). However, it can be universally applied to any other cell-free system of heterologous protein synthesis. Each step of the above protocol is detailed below.

FIGURE 1

CATEGORICAL ASSESSMENT OF EXPRESSION DATA

At the stage of expression assessment, all studied proteins are classified into three mutually exclusive categories – soluble (A), insoluble (C), and non-expressed (N) proteins (Figure 2). Each sequence can only be placed into one expression category and not into another. Soluble and insoluble products of protein synthetic reaction can be separated by centrifugation at 10,000 × g for 10 min and visualized by Coomassie Blue staining after SDS PAGE. The scores A, C, and N are assigned as follows: A, soluble proteins expressed at the level of more than 0.1 mg per ml of cell-free extract; C, expressed, but insoluble proteins; and N, non-expressed proteins with the expression level below 0.1 mg/ml. The protein products expressed at the level below 0.1 mg/ml are difficult to visualize on the Coomassie-stained gels, because the specific protein bands are masked by the endogenous proteins of the bacterial extract. Proteins that are expressed at a lower than expected molecular size should be classified into the category N, as they cannot attain proper structure and function. Notably, in this setting, the score A provides the upper estimation of soluble protein expression, because the procedure of centrifugation at 10,000 × g cannot discriminate between small protein aggregates and truly soluble proteins. Often, expressed proteins can be found in both soluble and insoluble fractions of the bacterial extract. Lane-to-lane comparison of total and supernatant fractions of the extract in PAGE gels is usually sufficient to establish the preferential pattern of protein expression.

FIGURE 2

CALCULATION AND PREDICTION OF MULTIPLE PROPERTIES OF EXPRESSED SEQUENCES

In this step, multiple features of the amino acid sequences in the expression dataset are calculated or predicted using existing bioinformatics tools. Various protein properties can be classified into the four major types, including physicochemical parameters, structural properties, the presence of specific sequence motifs, and the presence of PTM sites (Figure 3). Many of the physicochemical parameters, such as protein length, molecular weight, amino acid composition, number of charged residues, pI, hydrophobicity, etc., can be calculated using the free ProtParam tool available at the Expasy server¹. On the other hand, it is difficult to precisely calculate high-dimensional protein properties, because the 3D structures of expressed protein targets are usually unknown. Still, it is possible to deduce some structural features of the proteins in the expression dataset using existing prediction algorithms. Admittedly, some of these algorithms have quite low prediction accuracy, not exceeding 80%. The low accuracy of prediction thwarts the following correlation analysis, making impossible detection of weak correlations. Solvent accessibility can be assessed with the ACCpro 4.0 software downloaded from the SCRATCH Protein Predictor server (Cheng et al., 2005²) and content of secondary structure is evaluated with the PREDATOR 2.1.2 tool (Frishman and Argos, 1997) provided online³. Coiled coil structures are predicted with the pepcoil tool provided online⁴ (Lupas et al., 1991) and content of disordered structure is predicted with the RONN software (Yang et al., 2005⁵). The specific sequence motifs in proteins can also be predicted using available bioinformatics tools. PEST regions, signal sequences, and transmembrane domains are predicted with the tools provided online⁶^,^7,8. The sites of multiple PTMs, such as phosphorylation, glycosylation, amidation, Asx hydroxylation, sulfation, prenylation, etc., can be predicted using the PROSITE scanning tool PS_SCAN available online at http://www.hpa-bioinfotools.org.uk/cgi-bin/ps_scan/ps_scanCGI.pl. The sites of ubiquitination and SUMOylation are predicted using the site-specific predictors UbPred (Radivojac et al., 2010) and SUMOsp 2.0 (Ren et al., 2009) freely downloadable for academic research from http://ubpred.org/ and http://sumosp.biocuckoo.org/, respectively. The sites of S-palmitoylation are predicted with the CSS-Palm tool (Ren et al., 2008⁹) and S–S bonds can be predicted using the DIpro tool (Cheng et al., 2006) downloadable free from http://download.igb.uci.edu/intro.html.

FIGURE 3

CORRELATION OF THE INDIVIDUAL PROPERTIES WITH EXPRESSION SCORES

The multiple protein properties calculated and predicted using the above bioinformatics tools can be categorized into the three types, including yes/no, discrete, and continuous variables (Figure 4). Data processing and presentation differs for the three types of variables. The yes/no type variables, such as single-event PTMs, are the features that can be either present in or absent from proteins. To present the expression data associated with these variables, the bar graphs can be built, which show the ratio of proteins in the expression categories A, C, and N. The graphs should represent two subsets of proteins, excluding and including the analyzed feature. Total number of sequences in the two subsets should be defined. Using these graphs, it is easy to make a side-by-side comparison of the data for the two subsets and deduce the tendencies in protein expression amenability associated with the analyzed feature. To present the expression correlations associated with the discrete variables related to the protein futures repeatedly observed in the analyzed sequences, such as abundant multi-site PTMs, another type of data presentation is more convenient. In this case, the percentage of proteins in the expression categories A, C, and N is plotted at different values of analyzed parameter, covering the entire parameter range in the dataset. In addition, the distribution of dataset proteins according to parameter values should be presented. The distribution graphs provide important information concerning the abundance of studied protein features in the analyzed dataset. The processing of data associated with continuous variables, such as sequence hydrophobicity, solvent accessibility, content of intrinsic disorder, etc., is similar to that described for discrete variables. The graphs of A, C, and N scores, as well as the distribution graphs should be provided in the full range of continuous feature values. Curve smoothing is recommended to straighten the graphs obtained with continuous variables. It can be performed using the Excel chart smoothing algorithm. The examples of data presentation for the three types of variables associated with different protein properties are provided in our recent publication (Tokmakov et al., 2014).

FIGURE 4

STATISTICAL SIGNIFICANCE OF THE OBSERVED CORRELATIONS

The expression data processed by the proposed method represent categorical datasets, where all expressed sequences are classified into three categories – soluble (A), insoluble (C), and non-expressed targets (Figure 2). Thus, to evaluate the statistical significance of the observed correlations between the multiple protein features and protein amenability to cell-free expression, the categorical data analysis should be applied (Xu et al., 2010). The estimation of statistical significance should be provided for each expression category (A, C, and N). In addition, multiple protein properties are also categorized into the three types, such as yes/no, discrete, and continuous variables (Figure 4). Evaluation of statistical significance differs for the three types of variables. To deduce the statistical differences associated with yes/no type variables, the two-way contingency table test can be applied (Figure 5). The Fisher’s exact p-values can be computed using the tool provided on line at http://statpages.org/ctab2x2.html. Usually, a confidence level of 95% is set up as the null hypothesis rejection threshold. To evaluate the statistical significance of expression correlations associated with the discrete variables, which have a finite number of possible values, as well as the continuous variables, Pearson’s pairwise correlation coefficients should be calculated (Figure 5). The percentage of proteins in the expression categories A, C, and N should be paired with the values of the analyzed variable in the full range of variable values observed in the dataset. Statistical significance of the correlation coefficients is validated by calculating one-tailed probability values, given the value of correlation coefficient (r) and the sample size (n), with the significance level set to 0.05. Calculations of both correlation coefficients and p-values can be performed using the online statistics calculators available at http://www.danielsoper.com/statcalc3/. As a general comment, it should be noted that the confidence level of categorical data analysis increases greatly with the number of sequences in the expression datasets (Norman and Streiner, 2000).

FIGURE 5

RESULTS AND DISCUSSION

Using the developed method, expression of 3066 human proteins and their domains in a cell-free bacterial system has been analyzed. It was found that the rate of soluble expression (score A) in the investigated dataset constituted 25.7% (Kurotani et al., 2010). This value should be considered as a benchmark, as the similar success rate has been reported for a different subset of human proteins expressed in E. coli (Ding et al., 2002). Furthermore, a number of statistically significant correlations between calculated and predicted properties of amino acid sequences and their amenability to bacterial cell-free expression have been identified using the developed approach. The most influential features that affect protein amenability to cell-free expression are listed in Table 1.

Table 1

Expression property	Soluble	Insoluble	Undetectable
Length	–	ND	+
pI	±	±	ND
Charge	+	±	–
Hydrophobicity	–	+	ND
Solvent accessibility	+	–	+
Secondary structure	+	±	–
Intrinsic disorder	+	–	+
Protein domains	–	–	+
S–S bonds	–	+	+
Coiled coil	+	–	–
Transmembrane seqs	–	–	+
Localization signals	–	+	ND
PEST regions	+	–	+
Prenylation	+	ND	ND
Phosphorylation	+	–	–
Asn glycosylation	–	+	ND
Palmitoylation	–	±	+
Ubiquitination	+	–	ND
SUMOylation	+	–	±
Amidation	ND	ND	ND
Asx hydroxylation	ND	ND	ND
Sulfation	ND	ND	ND

Correlations of cell-free protein expression with calculated and predicted properties of amino acid sequences.

The signs (+) and (-) indicate positive and negative correlations, respectively; (±) refers to the opposite tendencies of expression estimates at different values of calculated parameters; and ND denotes the lack of correlation.

Notably, some of these features, such as protein pI, hydrophobicity, presence of localization signals, etc., are mostly related to protein solubility, whereas the others, such as protein length, charge, solvent accessibility, presence of S–S bonds, transmembrane sequences, PEST regions, etc., also affect the overall expression propensity. The presence of some specific sequence motifs was found to be one of the most discriminative parameters for expression propensity. The correlations revealed can be of practical use for protein engineering with the aim of increasing expression success. The rationales for these correlations are discussed in detail in the published paper (Kurotani et al., 2010).

In addition, it was found that amenability of human polypeptide sequences to bacterial cell-free expression correlates with the presence of multiple PTM sites bioinformatically predicted in these sequences (Tokmakov et al., 2012; Table 1). Surprisingly, the presence of predicted sites for several PTMs, such as ubiquitination, SUMOylation, etc. (Table 1), was associated with increased production of properly folded soluble protein. However, no SUMOylation and ubiquitination machineries are known to exist in bacteria, suggesting that the presence of these PTM sites in amino acid sequences is related to intrinsically better protein solubility even in the absence of the modifications. It was hypothesized that physicochemical and/or structural characteristics of the modification sites themselves convey the better solubility (Tokmakov et al., 2012). Altogether, these findings indicate that identification of potential PTM sites in polypeptide sequences can be of practical use for predicting expression success and optimizing heterologous protein synthesis. Currently, a discriminant-based machine-learning algorithm that utilizes multiple features of amino acid sequences to predict the success rate of heterologous protein synthesis is being developed based on the reported findings. The algorithm will provide a basis for the internet-based tool for predicting amenability of eukaryotic proteins to cell-free expression in a prokaryotic system.

Statements

Acknowledgments

This work was supported by the research fund for Foreign Visiting Professor from Kobe University and the Grant-in-Aid for Scientific Research from the Ministry of Education, Culture, Sports, Science and Technology of Japan (no. 25440023).

Conflict of interest

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Footnotes

1.^http://www.expasy.org/tools/

2.^http://scratch.proteomics.ics.uci.edu/explanation.html

3.^http://mobyle.pasteur.fr/cgi-bin/portal.py?#forms::predator

4.^http://emboss.sourceforge.net/apps/cvs/emboss/apps/pepcoil.html

5.^http://www.strubi.ox.ac.uk/RONN

6.^http://emboss.bioinformatics.nl/cgi-bin/emboss/pestfind

7.^http://www.cbs.dtu.dk/services/SignalP/

8.^http://harrier.nagahama-i-bio.ac.jp/sosui/

9.^http://csspalm.biocuckoo.org

REFERENCES

1
BertoneP.KlugerY.LanN.ZhengD.ChristendatD.YeeA.et al (2001). SPINE: an integrated tracking database and data mining approach for identifying feasible targets in high-throughput structural proteomics.Nucleic Acids Res.292884–2898. 10.1093/nar/29.13.2884
- CrossRef
- Google Scholar
2
ChenR. (2012). Bacterial expression systems for recombinant protein production: E. coli and beyond.Biotechnol. Adv.301102–1107. 10.1016/j.biotechadv.2011.09.013
- CrossRef
- Google Scholar
3
ChengJ.RandallA. Z.SweredoskiM. J.BaldiP. (2005). SCRATCH: a protein structure and structural feature prediction server.Nucleic Acids Res.33W72–W76. 10.1093/nar/gki396
- CrossRef
- Google Scholar
4
ChengJ.SaigoH.BaldiP. (2006). Large-scale prediction of disulfide bridges using kernel methods, two-dimensional recursive neural networks, and weighed graph matching.Proteins62617–629. 10.1002/prot.20787
- CrossRef
- Google Scholar
5
DingH. T.RenH.ChenQ.FangG.LiL. F.LiR.et al (2002). Parallel cloning, expression, purification, and crystallization of human proteins for structural genomics.Acta Crysallogr. D Biol. Crystallogr.582102–2108. 10.1107/S0907444902016359
- CrossRef
- Google Scholar
6
DysonM. R.ShadboltS. P.VincentK. J.PereraR. L.McCaffertyJ. (2004). Production of soluble mammalian proteins in Escherichia coli: identification of protein features that correlate with successful expression.BMC Biotechnol.4:32. 10.1186/1472-6750-4-32
- CrossRef
- Google Scholar
7
FrishmanD.ArgosP. (1997). Seventy-five percent accuracy in protein secondary structure prediction.Proteins27329–335. 10.1002/(SICI)1097-0134(199703)27:3<329::AID-PROT1>3.0.CO;2-8
- CrossRef
- Google Scholar
8
GohC. S.LanN.DouglasS. M.WuB.EcholsN.SmithA.et al (2004). Mining the structural genomics pipeline: identification of protein properties that affect high-throughput experimental analysis.J. Mol. Biol.336115–130. 10.1016/j.jmb.2003.11.053
- CrossRef
- Google Scholar
9
HeM. (2008). Cell-free protein synthesis: applications in proteomics and biotechnology.Nat. Biotechnol.25126–132. 10.1016/j.nbt.2008.08.004
- CrossRef
- Google Scholar
10
Idicula-ThomasS.BalajiP. V. (2005). Understanding the relationship between the primary structure of proteins and its propensity to be soluble on overexpression in Escherichia coli.Protein Sci.14582–592. 10.1110/ps.041009005
- CrossRef
- Google Scholar
11
KatzenF.ChangG.KudlickiW. (2005). The past, present and future of cell-free protein synthesis.Trends Biotechnol.23150–156. 10.1016/j.tibtech.2005.01.003
- CrossRef
- Google Scholar
12
KigawaT.MatsudaT.YabukiT.YokoyamaS. (2008). “Bacterial cell-free system for highly efficient protein synthesis,” inCell-Free Protein SynthesisedsSpirinA. S.SwartzJ. R. (Wiley VCH: Weinheim)83–97.
- Google Scholar
13
KigawaT.YabukiT.YoshidaY.TsutsuiM.ItoY.ShibataT.et al (1999). Cell-free production and stable-isotope labeling of milligram quantities of proteins.FEBS Lett.44215–19. 10.1016/S0014-5793(98)01620-2
- CrossRef
- Google Scholar
14
KurotaniA.TakagiT.ToyamaM.ShirouzuM.YokoyamaS.FukamiY.et al (2010). Comprehensive bioinformatics analysis of cell-free protein synthesis: identification of multiple protein properties that correlate with successful expression.FASEB J.241095–1104. 10.1096/fj.09-139527
- CrossRef
- Google Scholar
15
LupasA.Van DykeM.StockJ. (1991). predicting coiled coils from protein sequences.Science2521162–1164. 10.1126/science.252.5009.1162
- CrossRef
- Google Scholar
16
NormanG. R.StreinerD. L. (2000). Biostatistics: The Bare Essentials.DeckerB. C.Hamilton.
- Google Scholar
17
RadivojacP.VacicV.HaynesC.CocklinR. R.MohanA.HeyenJ. W.et al (2010). Identification, analysis, and prediction of protein ubiquitination sites.Proteins78365–380. 10.1002/prot.22555
- CrossRef
- Google Scholar
18
RenJ.GaoX.JinC.ZhuM.WangX.ShawA.et al (2009). Systematic study of protein sumoylation: development of a site-specific predictor of SUMPsp 2.0.Proteomics93409–3412. 10.1002/pmic.200800646
- CrossRef
- Google Scholar
19
RenJ.WenL.GaoX.JinC.XueY.YaoX. (2008). CSS-Palm 2.0: an updated software for palmitoylation sites prediction.Protein Eng. Des. Sel.21639–644. 10.1093/protein/gzn039
- CrossRef
- Google Scholar
20
SivashanmugamA.MurrayV.CuiC.ZhangY.WangJ.LiQ. (2009). Practical protocols for production of very high yields of recombinant proteins using Escherichia coli.Protein Sci.18936–948. 10.1002/pro.102
- CrossRef
- Google Scholar
21
SorensenH. P.MortensenK. K. (2005). Advanced genetic strategies for recombinant protein expression in Escherichia coli.J. Biotechnol.115113–128. 10.1016/j.jbiotec.2004.08.004
- CrossRef
- Google Scholar
22
SpirinA. S. (2004). High-throughput cell-free systems for synthesis of functionally active proteins.Trends Biotechnol.22538–545. 10.1016/j.tibtech.2004.08.012
- CrossRef
- Google Scholar
23
TokmakovA. A.KurotaniA.ShirouzuM.FukamiY.YokoyamaS. (2014). Bioinformatics analysis and optimization of cell-free protein synthesis.Methods Mol. Biol.111817–33. 10.1007/978-1-62703-782-2_2
- CrossRef
- Google Scholar
24
TokmakovA. A.KurotaniA.TakagiT.ToyamaM.ShirouzuM.FukamiY.et al (2012). Multiple post-translational modifications affect heterologous protein synthesis.J. Biol. Chem.28727106–27116. 10.1074/jbc.M112.366351
- CrossRef
- Google Scholar
25
XuB.FengX.BurdineR. D. (2010). Categorical data analysis in experimental biology.Dev. Biol.3483–11. 10.1016/j.ydbio.2010.08.018
- CrossRef
- Google Scholar
26
YabukiT.MotodaY.HanadaK.NunokawaE.SaitoM.SekiE.et al (2007). A robust two-step PCR method of template DNA production for high-throughput cell-free protein synthesis.J. Struct. Funct. Genomics8173–191. 10.1007/s10969-007-9038-z
- CrossRef
- Google Scholar
27
YangZ. R.ThomsonR.McMeilP.EsnoufR. M. (2005). RONN: the bio-basis function neural network technique applied to the detection of natively disordered regions in proteins.Bioinformatics213369–3376. 10.1093/bioinformatics/bti534
- CrossRef
- Google Scholar
28
YokoyamaS. (2003). Protein expression systems for structural genomics and proteomics.Curr. Opin. Chem. Biol.739–43. 10.1016/S1367-5931(02)00019-4
- CrossRef
- Google Scholar

Summary

Keywords

cell-free protein synthesis, protein solubility, physicochemical and structural protein properties, categorical data analysis, correlation analysis

Citation

Tokmakov AA (2014) Identification of multiple physicochemical and structural properties associated with soluble expression of eukaryotic proteins in cell-free bacterial extracts. Front. Microbiol. 5:295. doi: 10.3389/fmicb.2014.00295

Received

10 April 2014

Accepted

29 May 2014

Published

20 June 2014

Volume

5 - 2014

Edited by

Salvador Ventura, Universitat Autonoma de Barcelona, Spain

Reviewed by

George-John Nychas, Agricultural University of Athens, Greece; Kirill Alexandrov, University of Queensland, Australia

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Alexander A. Tokmakov, Research Center for Environmental Genomics, Kobe University, Rokko dai 1-1 Nada, Kobe, Hyogo 657-8501, Japan e-mail: tokmak@phoenix.kobe-u.ac.jp

This article was submitted to Microbial Physiology and Metabolism, a section of the journal Frontiers in Microbiology.

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Physiology and Metabolism of Microorganisms

METHODS article

Identification of multiple physicochemical and structural properties associated with soluble expression of eukaryotic proteins in cell-free bacterial extracts

Abstract

INTRODUCTION