Calcium-Dependent Protein Kinases from Arabidopsis Show Substrate Specificity Differences in an Analysis of 103 Substrates

The identification of substrates represents a critical challenge for understanding any protein kinase-based signal transduction pathway. In Arabidopsis, there are more than 1000 different protein kinases, 34 of which belong to a family of Ca2+-dependent protein kinases (CPKs). While CPKs are implicated in regulating diverse aspects of plant biology, from ion transport to transcription, relatively little is known about isoform-specific differences in substrate specificity, or the number of phosphorylation targets. Here, in vitro kinase assays were used to compare phosphorylation targets of four CPKs from Arabidopsis (CPK1, 10, 16, and 34). Significant differences in substrate specificity for each kinase were revealed by assays using 103 different substrates. For example CPK16 phosphorylated Serine 109 in a peptide from the stress-regulated protein, Di19-2 with KM ∼70 μM, but this site was not phosphorylated significantly by CPKs 1, 10, or 34. In contrast, CPKs 1, 10, and 34 phosphorylated 93 other peptide substrates not recognized by CPK16. Examples of substrate specificity differences among all four CPKs were verified by kinetic analyses. To test the correlation between in vivo phosphorylation events and in vitro kinase activities, assays were performed with 274 synthetic peptides that contained phosphorylation sites previously mapped in proteins isolated from plants (in vivo-mapped sites). Of these, 74 (27%) were found to be phosphorylated by at least one of the four CPKs tested. This 27% success rate validates a robust strategy for linking the activities of specific kinases, such as CPKs, to the thousands of in planta phosphorylation sites that are being uncovered by emerging technologies.

. CRKs differ in having a degenerate calmodulin-like domain containing substitutions expected to disrupt the Ca 2+ binding EF hands (Harper et al., 2004). Nevertheless, at least some CRKs can be activated by Ca 2+ through a bimolecular interaction with calmodulin (Jeong et al., 2007).
Despite being implicated in so many diverse aspects of plant and protozoan biology, relatively little is known about the downstream substrates regulated by CPKs. CPKs are expected to be multifunctional, with many target substrates. This speculation is supported by analogy to CaM-dependent kinases in animals (CaMKs), as well as an expanding list of candidate substrates tested by in vitro kinase reactions (e.g., Harper et al., 2004;Klimecka and Muszynska, 2007;Kanchiswamy et al., 2010).
While most candidate substrates for CPKs have been identified through in vitro kinase reactions, there are several for which there is evidence of in vivo phosphorylation or a kinase-substrate interaction, including the following. A tobacco CDPK1 has been implicated in regulating the phospho-dependent binding between a 14-3-3 protein and a transcription factor (RSG) involved in mediating GA signaling (Ishida et al., 2008;Ito et al., 2010). Isoforms AtCPK21 and 23 have been implicated in binding and activating a guard cell anion channel SLAC1 (Geiger et al., 2010). For AtCPK32, evidence supports an interaction with an ABA-induced transcription factor (ABF4) that can alter ABA-triggered responses (Choi et al., 2005). A soybean CPK has been linked to the regulation of a serine acetyltransferase in response to oxidative stress (Liu et al., 2006). Nitrate reductase (NR) activity can be regulated by CPK phosphorylation, which promotes a phospho-dependent binding interaction with 14-3-3 proteins (Bachmann et al., 1996;Douglas et al., 1997Douglas et al., , 1998Lambeck et al., 2010).
Multiple strategies have been used to help identify substrates for CPKs and other kinases in plants (Peck, 2006;de la Fuente van Bentem et al., 2008;Kersten et al., 2009;Uno et al., 2009;Huang et al., 2010;Mehlmer et al., 2010). While a "one protein at a time" strategy is still relevant, proteome-wide surveys employing mass spectrometry have rapidly expanded the list of mapped phosphorylation sites in plant proteomes since 2003 (Nuhse et al., 2003). An important aspect of these surveys is their potential to identify phosphorylation sites occurring in vivo (in vivo-mapped sites). However, correlating these data with the specific kinases responsible for each phosphorylation event remains a challenge. To this end, custom-designed peptide libraries provide a strategy to rapidly test large numbers of in vivo-mapped phosphorylation sites for their potential to be phosphorylated by a specific kinase.
Here we present results from three complementary strategies used to help identify CPK substrates. The first approach was a survey of candidate proteins using in vitro kinase reactions, followed by phosphorylation site mapping using mass spectrometry. The second approach was to test 274 in vivo-mapped phosphorylation sites (Peck, 2006;de la Fuente van Bentem et al., 2008;Kersten et al., 2009;Uno et al., 2009;Huang et al., 2010;Mehlmer et al., 2010) for their potential to be phosphorylated in vitro by a particular CPK. The third approach was to test sites that were predicted based on consensus phosphorylation site information. For example, 206 of the predicted sites tested here were derived from proteins with evidence for a 14-3-3 interaction (Chang et al., 2009). Potential 14-3-3 binding sites were considered logical candidates for CPK substrates because a consensus site proposed for CPK phosphorylation overlaps with many phospho-sites implicated in 14-3-3 binding interactions (Johnson et al., 2010). Because most 14-3-3 interactions are thought to be mediated by phosphorylation, this focus was postulated to enrich for potential CPK sites of biological relevance.
Using peptides derived from both in vivo mapping and bioinformatic prediction strategies, we identified 89 new peptide substrates that were recognized by at least one of four CPKs tested (isoforms 1, 10, 16, and 34). These new peptide substrates, in addition to 13 reference peptides, revealed significant differences in substrate specificity for each of the four CPKs. Isoform CPK16 recognized the fewest substrates (nine), whereas CPK34 recognized the most (97 total, 31 of which were isoform-specific). This observed diversity of substrate specificity helps explain the origin of more than seven different CPK consensus sites that have been proposed from different studies based on different isoforms (Harper et al., 2004). This survey also supports a model for CPK signaling networks in which substrate specificity contributes to the isoform-specific functions of different CPKs. results dI19-2 Is a good substrate for cPK16, but not cPK1, 10, and 34 To search for potential CPK16 substrates, we initiated a survey of protein and peptide substrates using in vitro kinase assays. One group of proteins tested included members of a family of droughtinduced proteins (Di19s; Milla et al., 2006;Rodriguez Milla et al., 2006; data not shown). Some Di19s had previously been found to interact with CPKs in a yeast two-hybrid screen (Rodriguez Milla et al., 2006). Isoform Di19-2 was chosen for further analysis because it was strongly phosphorylated by CPK16 (Figure 1). In contrast, assays of Di19-2 with three other CPKs (1, 10, and 34) showed more than 10-fold lower levels of phospho-labeling.
All four CPKs were further tested in parallel reactions with a fusion protein containing a known CPK phosphorylation site found in NR. In this case, CPK16 showed weaker substrate labeling compared with the other three CPKs. In contrast, the phosphorylation of substrate Di19-2 site S109 was more than 10-fold stronger for CPK16 than for the other three kinases (in Figure 1, compare lane NR with Di19-2 site S109 wt; discussed in more detail below). This inverse relationship of relative labeling efficiencies indicated that CPK16 had a distinct substrate preference compared to CPKs 1, 10, and 34.
To identify CPK16 phosphorylation sites in Di19-2, a fusion protein was phosphorylated, digested by trypsin or Glu-C, and analyzed by mass spectrometry. Residue S109 was identified as a phosphorylation site (data not shown). This site was subsequently confirmed using a fusion protein substrate that displayed the region from D99 to Y115 (described below). CPK16 also phosphorylated additional sites in Di19-2, as indicated by the observation that phosphorylation was still detected in a full-length Di19-2 fusion protein in which the S109 site was substituted with an alanine (S109A, data not shown).
To conduct a more detailed analysis on the S109 site, a fusion protein (G-S109-ST2) was engineered that displayed a fragment from D99 to Y115 (DVLKS 103 EQKEMS 109 YREDPY), flanked of S109 to A (S > A) was shown to eliminate any detectable labeling, consistent with S109 serving as the target phosphorylation site (Figure 4, and see Figure 1 to compare lane S > A to wt). Second, a substitution of S103 to A (a potential site upstream of S109) failed to result in any decrease in phosphorylation, confirming that S103 was not needed for substrate labeling. These results confirmed that fusion protein G-S109-ST2 has a single CPK16 phosphorylation site at S109.
To further define the residues surrounding S109 that were important for CPK16 phosphorylation, 26 additional site-specific substitutions were introduced and tested for their effect on by two purification tags -an upstream GST-GFP (Glutathione S-Transferase-Green Fluorescence Protein), and a downstream Strep-2 tag (Figure 2). For comparison, a parallel fusion protein was created with an established CPK phosphorylation site from NR (Figure 2, NR sequence).
As expected for a CPK substrate, phosphorylation of the G-S109-ST2 fusion protein was stimulated by Ca 2+ (Figure 3). This fusion protein substrate was consistently purified as mixture of two proteins (approximately 60 and 30 kD; Figure 3). The larger protein migrated at the predicted size for the engineered GST-GFPpeptide fusion. The smaller fragment was assumed to be a product of proteolysis. As shown below, both fragments were found to have a single CPK16 phosphorylation site.

Figure 3 | Kinase reactions
showing Ca 2+ -dependent phosphorylation of a Di19-site-S109 fusion protein (g-gFP-S109-ST) by CPK16. Three kinase reactions are shown on the left side after separation on an SDS-12% polyacrylamide gel and staining with Coomassie Brilliant Blue (1s,2s, 3s). The same gel is shown to the right as an autoradiogram to show the 32 P-labeling of phosphorylated proteins (1r, 2r, 3r). Kinase reactions were conducted as described in Section "Materials and Methods, " plus (+) or minus (−) Ca 2+ , with 1 μg of fusion protein substrate, 0.01 μg of kinase, in a 10-μL reaction. After a 10-min incubation, the entire reaction was subjected to SDS-PAGE. In control reactions, CPK16 did not phosphorylate the GST-GFP portion of the Di19-2-site-S109 fusion protein (e.g., see Figure 1). Note that the purified G-GFP-S109-ST fusion protein runs as two bands with apparent molecular weights of 48 and 60 kD. Results shown are representative of multiple independent analyses. Figure 2 | Diagram of the fusion protein system used for analysis of CPK phosphorylation sites from Di19-2 (G-S109-ST, ps1160) and nitrate reductase (ps699). In some experiments, a similar fusion system was used in which the GFP was replaced by an RFP (G-RFP-NR-ST, ps630).
in phosphorylation was observed with all four substitutions. This included a conservative substitution of R to K. In total, the analysis of these 26 substitutions supports a classification of site S109 as a specific variation of a CPK "simple-2" consensus site (Harper and Harmon, 2005), which was defined as (S/T) × (Basic). The variation here can be described as (S/T; F preferred, D detrimental; R preferred over K).
The G-S109-ST2 fusion protein was further tested for phosphorylation by three other CPKs (isoforms 1, 10, and 34; Figure 1, lane Di19-2 site S109 wt). All three failed to provide significant phosphorylation. In side-by-side controls, all three CPKs were confirmed to be as active as CPK16, as indicated by their relatively stronger phosphorylation of a fusion protein harboring a site from NR (see lane NR in Figure 1). The failure of CPK1, 10, and 34 to phosphorylate S109 is consistent with the original observation that the fulllength Di19-2 was a poor substrate for these three CPK isoforms.
The K M for the phosphorylation of S109 by CPK16 was determined to be approximately 70 μM, when assayed in the context of the G-S109-ST2 fusion shown in Figure 2. Surprisingly however, no significant phosphorylation was obtained using a synthetic 15-amino acid peptide containing the S109 site at the center of the peptide. It is not clear if the synthetic peptide failed because of issues with secondary structure, solubility, or the inclusion of a norleucine (NorLeu) substitution at residue M108 (NorLeu was used in synthetic peptides to replace methionines with a nonoxidizable side chain).

PePtIde substrate survey
To conduct a more extensive survey for potential CPK substrates, we used four CPKs and compared their relative ability to phosphorylate 713 peptides, of which 179 represented experimental variations from the core list of 534 peptides (Table S1 in Supplementary Material). This library of synthetic peptides was generated based on five criteria: (1) A subset of 274 peptides were based on in vivo phosphorylation sites mapped in proteins isolated from plant extracts (Nuhse et al., 2003(Nuhse et al., , 2004(Nuhse et al., , 2007Peck, 2006;Benschop et al., 2007;and Harper, unpublished); (2) 206 peptides were based on predicted 14-3-3 phospho-dependent binding sites in proteins for which evidence had previously been obtained for an in vivo interaction with a 14-3-3 (Chang et al., 2009); (3) 14 peptides were based on mapped sites from in vitro phosphorylation reactions with CPKs; (4) 15 peptides were designed based on bioinformatic predictions of potential CPK consensus phosphorylation sites showing strong conservation between rice and Arabidopsis (Harper, unpublished); and (5) the remainder were reference peptides corresponding to phosphorylation sites established for a variety of plant and animal kinases.
The relative phospho-labeling of peptides was compared for each CPK using a scintillation proximity assay (SPA). Of the 534 peptides tested, 102 ("hits") were found to show significant labeling by at least one of the four CPKs tested ( Table 1). Of these, 13 were reference peptides, 74 peptides contained phosphorylation sites with evidence for in vivo phosphorylation, and the remaining 15 were predicted sites.
Note that the SPA assays were used here as a survey tool. As it was not practical to conduct the SPA assays under Michaelis-Menten conditions, these assays do not represent a rigorous test of phosphorylation (Figure 4). Two positions appeared very sensitive to changes. At position +1 (Y110), a more than two-fold decrease in phosphorylation was observed by the substitution of a Y to D (acidic), but not a Y to K (basic). This indicates that a negative charge in this location is detrimental. However, a Y to F substitution showed a greater than three-fold improvement in phosphorylation. Because alternative hydrophobic substitutions of W or I failed to show similar improvements, the Y to F substitutions identifies a specific hydrophobic residue that is highly preferred at this location. At position +2 (R111; Figure 4), a more than two-fold decrease Figure 4 | Kinase reactions with CPK16 comparing relative phospholabeling of Di19-2-site-S109 and variants. (A) For each amino acid substitution, the fold change in phosphorylation is shown in comparison to phosphorylation levels for the wild type version of the S109 site encoded in G-GFP-S109-ST. All substitutions were engineered by site-specific mutagenesis of the same parental clone, G-GFP-S109-ST. All reactions, SDS-PAGE gels, and autoradiograms were performed as described in Figure 2 and in Section "Materials and Methods. " The percentage incorporation of 32 P was determined by analysis of autoradiograms and was normalized to the amount of Coomassie Brilliant Blue-stained protein for each substrate. Results shown are the average of at least four independent analyses for each variant. Error bars show SD. (B) A summary of changes in phosphorylation as a result of substitutions. Peptides with similar phosphorylation potentials are shown first (i.e., changes less than twofold compared to wild type), followed by those that were higher or lower by more than twofold.

IdentIfIcatIon of PhosPho-sItes by "PePtIde maPPIng" wIth alanIne substItutIons
Because most of the 102 synthetic peptide hits contained multiple Ss or Ts, the specific phosphorylation site(s) could not be assigned from the initial survey reactions. For a subset of positive hits, additional mutant peptides were synthesized in which selected S or T residues were replaced with alanines. Using this approach, 43 phospho-sites were mapped to a specific S or T (indicated by an asterisk in Table 1). An additional 12 peptides contained only a single S or T, in a total of 55 peptides in which specific phospho-sites were established. Of these, only one site did not conform to either a "simple-1" consensus site [basic-X-X-(S/T)] or a "simple-2" site [(S/T)-Xbasic]. The exception was jfh-149 (RTAFLSQY[S] * FYKSLL), which was derived from an in vivo-mapped phospho-site in a lipid flippase, ALA2 (Harper lab, unpublished). However, the low frequency of non-consensus sites does not necessarily indicate that CPKs are more likely to conform to these simple consensus sites, as nearly all the peptides tested here were selected because they already contained a proposed CPK consensus site. This "mutant-peptide" mapping strategy also corroborated significant differences in substrate specificity determinants for each of the CPKs tested. For six peptides with various alanine-substituted versions, differential effects on substrate recognition were observed among CPKs 1, 10, and 34 ( Table 2). In eight cases, recognition by two of the three isoforms was lost, and in four cases recognition was lost by only one isoform. For example, a mutant version of jfh-579 (i.e., jfh-688) revealed a disruption of phosphorylation by CPK10, but not by CPK1 and CPK34; and a mutant version of jfh-458 (i.e., jfh-723) provided a disruption of phosphorylation by CPK1, but not by CPK34. Table 3 shows SPA results for variations on peptide jfh-515, which was recognized by all four CPKs tested, including CPK16. This peptide contained three Ss, the central one identified as an in vivo phosphorylation site. By mutating the central S (jfh-734), we found that at least one of the flanking Ss was still labeled by all four CPKs. However, when both flanking S residues were changed to a peptide's kinetic properties as a substrate. Thus, a relatively small change in cut-off thresholds could change a particular substrate's designation as a positive or negative. For this reason, selected peptides were chosen for follow-up assays to verify examples of substrate specificity (discussed below). In contrast to the SPA assays, the standard filter-based assays were done under Michaelis-Menten conditions. These assays (Tables 4 and 5) confirmed that the SPA assay conditions used here provided a reasonable indicator of a peptide's relative strength as a CPK substrate.
Of the 102 peptide substrate hits in the SPA assay, the largest subset (33) were phosphorylated by three CPKs: isoforms 1, 10, and 34. However, each of these CPKs phosphorylated a distinct subset of shared and isoform-specific peptides, supporting a conclusion that each isoform has unique substrate specificity. CPK34 had the largest number of isoform-specific substrates (31), whereas CPKs 10 and 1 had only 3 and 2 substrates, respectively.
CPK16 phosphorylated the fewest synthetic peptides (nine). The most strongly labeled peptides included two peptides from a ubiquitin ligase (ATL6, AT3G05200, peptides jfh-495 and -496) and a protein of unknown function (AT3G27390, peptide jfh-515), all three of which contained a putative CPK site identified through mapping surveys of in vivo phosphorylated proteins. Another was jfh-097, a peptide containing a predicted 14-3-3 binding site from a glutamate receptor 3.7 (AT2G32400), previously identified as a 14-3-3 interactor (Chang et al., 2009). The only reference peptide to show significant labeling was the non-plant peptide sequence MARCKS (ach-400).
Of the nine synthetic peptide substrates for CPK16, all were phosphorylated by at least one of the other three CPKs tested (i.e., substrates were not CPK16-specific). However, because each of these peptides contain multiple serines, it was not clear if CPK16 recognized the same or a different phosphorylation site. In one case (peptide jfh-097), Edman degradation was used to confirm that the same serine could be phosphorylated by CPK16 and CPK34 [in the tryptic fragment, RTS(pS)MP]. This site matched a "simple-1" consensus motif, indicating that CPK16 is not restricted to a "simple-2" consensus site (like the S109 site of Di19-2, Figure 4). Relative phosphorylation levels are shown as measured by SPA assays. Values represent the fold-increase over "no peptide" controls for each CPK (average of at least two independent replicates). Peptides are sorted according to degree of phosphorylation by CPK34, except in six cases where a comparison to a closely related peptide is shown immediately after the more highly phosphorylated example. A dash "-" indicates that the fold increase was "not significant. Table S1 in Supplementary Material). Residues in parentheses ( ) correspond to sites predicted based on in vivo mapping evidence from a homolog (see Table S1 in Supplementary Material). Underlined and asterisk labeled residues were shown to be phosphorylated by CPK34 (and others when indicated) by either peptide mapping or when that residue represents the only serine or threonine residue in the peptide. Underlined residues without asterisk represent the candidate sites based on available evidence. For sequences from Arabidopsis proteins, the AGI# identifies the corresponding gene. Table S1 in Supplementary Material. substrate, reactions with CPK16 showed significantly weaker K M s (as much as 40-fold) compared to CPK34. However, CPK16 reactions showed higher turnover rates (k cat ), resulting in catalytic efficiencies for CPK34 and CPK16 that were within five-fold of each other. For CPK16, the best catalytic efficiency was observed with the ach-400 peptide (MARKS), which was approximately five-fold greater than with CPK34. Similar kinetic differences were observed in reactions comparing two peptides recognized by CPKs 1, 10, and 34 (ach-393/syntide-2 and jfh-597; Table 5). The peptide jfh-597 was derived from a K + channel, GORK (jfh-317), and is modified with an alanine substitution that leaves a single S residue corresponding to the location of a phospho-S in a potential 14-3-3 binding site. Consistent with the alanine (leaving only the central mapped pS; peptide jfh-735), phosphorylation occurred with CPKs, 1, 10, and 34, but not with CPK16. The failure of CPK16 to recognize this double alanine-substituted version is consistent with it having distinct substrate specificity.

examPles of KInetIc dIfferences between cPKs 16, 1, 10, and 34
To confirm the qualitative differences observed here for substrate specificity among the four CPKs tested, we compared reactions with selected substrates and kinases for differences in K M s, k cat s, and catalytic efficiencies under standard Michaelis-Menten conditions.
In analyses using three peptides recognized by CPK16 and 34 (jfh-097, ach-400, and jfh-515), significant differences where observed in multiple kinetic parameters ( Table 4). For each  the kinase domains are more similar to a family of CPK-related kinases (CRKs) than to the core group of 31 other Arabidopsis CPKs (Harper et al., 2004). Nevertheless, CPK16 displays Ca 2+ -stimulated activity (Figure 3) and contains a typical calmodulin-like regulatory domain -the structural feature that initially defined the CPK family. CPK16's unique substrate specificity was revealed initially by analysis of a Di19-2 fusion protein. This fusion protein was strongly phosphorylated by CPK16, but more weakly by CPK1, 10, or 34 (Figure 1). A fusion protein G-S109-ST2 was engineered to display just one of the CPK16 sites present in the Di19-2 protein. This substrate had a K M of 70 μM in reactions with CPK16, but failed to show any significant phosphorylation by the other three CPKs. In contrast, CPK16 only weakly recognized a site from NR, which was strongly phosphorylated by the other three kinases (Figure 1). This comparison showed that CPK16 had a substrate preference contrary to that of CPKs 1, 10, and 34. In another recent study, NR was found to be a good substrate for AtCPK isoforms 17 and 2, but not for CPKs 16, 24, or 29 (Lambeck et al., 2010). This supports the expectation that CPK16 and other CPKs will show significant differences in substrate specificity.

Relative phosphorylation levels are shown as determined by a standard SPA assay. Values shown represent the average of at least two independent assays. Position of predicted or in vivo mapped sites are designated by ( ) or [ ] respectively. Underlined residues in mutant peptides show potential phosphorylation sites. S or Ts with evidence of phosphorylation are shown in bold. Small case "a" indicates an S to A substitution.
The extent to which the substrate specificity of CPK16 differed was further demonstrated here by a survey of 534 unique peptide substrates ( Table 1). From the 102 synthetic peptide hits that were identified as in vitro substrates for at least one of the four CPKs, only nine showed significant phosphorylation by CPK16. By comparison, the next lowest number of substrates identified, 41, was found for CPK10. While all nine synthetic peptide substrates for CPK16 were also recognized by CPK34, the presence of multiple S/Ts in each peptide leaves open the possibility that each enzyme might still recognize other sites within a given peptide.
Substrate specificity differences were also revealed by comparison of CPK1, 10, and 34. In the survey of peptide substrates using an SPA assay (Table 1), CPK34 had the highest number of isoform-specific substrates (32), whereas CPK10 had only three. While CPK1 showed only two isoform-specific substrates, its 64 positive substrate hits were a unique mixture of those recognized by the other three isoforms. In follow-up assays on five peptides, a comparison of kinetic parameters with each of the CPKs illustrated reactions showing greater than 10-fold differences in K M s and k cat s (Tables 4 and 5). This indicates that each of these four enzymes have significant differences in substrate recognition and turnover (i.e., substrate specificity).

the molecular basIs of substrate sPecIfIcIty dIfferences
The structural basis for CPK substrate recognition is not yet clear. Multiple features of the kinase might contribute to specificity, from various amino acid residues associated with the substrate binding groove in the kinase domain, to various features associated with the N-and C-terminal domains interacting with substrates. For NtCDPK1, the variable N-terminal domain can be transferred to AtCPK9, thereby altering the substrate specificity of CPK9 to match that of CDPK1 (Ito et al., 2010). The crystal structures of apicomplexan CPKs suggest a possible interaction of the C-terminal regulatory domain with a "latch" in the N-terminal domain (Wernimont et al., 2010(Wernimont et al., , 2011. All of the intramolecular interactions that occur during activation of a CPK might also influence substrate recognition and turnover kinetics. notion that each of these CPKs has different substrate specificities, each peptide/kinase reaction displayed diverse kinetic parameters, with as much as a 10-fold difference in K m s and k cat s, and a two-to eight-fold range in catalytic efficiencies. dIscussIon substrate sPecIfIcIty dIfferences for cPKs 1, 10, 16, and 34 CPK isoforms 1, 10, 16, and 34 represent 4 of the 12 different subgroups of CPKs found in flowering plants (Harper et al., 2004). Three lines of evidence presented here support a model in which CPKs from different subgroups are likely to have functionally distinct substrate specificities. First, each of the four CPKs recognized unique subsets of substrates (see the 102 peptides in Table 1 and protein Di19-2 in Figure 1). Second, alanine substitutions in seven selected peptides uncovered different substrate recognition determinants for each of the four kinases (Tables 2 and 3). Finally, kinetic analyses using five selected peptides showed more than 10-fold differences in K M s and k cat s for each of the four different CPKs (Tables 4 and 5).
The most distinct substrate specificity was observed for CPK16 (Table 1 and Figure 1). CPK16 is a representative of the most divergent subgroup of CPKs in Arabidopsis. In this subgroup,  locations, as indicated by DsRED2 tagging experiments . Because CPK16 appears to be located at the plasma membrane, the phosphorylation of cytosolic Di19-2 might provide a pathway to relay Ca 2+ signals from the plasma membrane to the nucleus. The observed 70 μM K M for the phosphorylation of the S109 site by CPK16 is consistent with substrate affinities expected for biologically relevant phospho-regulation (Harper et al., 2004). Interestingly, Di19-2 is not recognized as a good substrate by at least five other CPKs isoforms, including 1, 10, 34 (Figure 1), 3, and 11 . In contrast, the other five members of the Di19 family in Arabidopsis were shown to be good substrates for multiple CPK isoforms. Thus, a potential CPK-mediated phosphoregulation of Di19-2 may require a CPK16-like substrate specificity.

modulatIon of PhosPhorylatIon by methIonIne oxIdatIon
The Huber lab recently documented an example of a CPK phosphorylation site in NR (similar to peptide jfh-225) in which phosphorylation is blocked by the oxidation of a flanking methionine (e.g., P + 4 position; Hardin et al., 2009). Because thioether groups of methionines can be oxidized to sulfoxide groups by ROS such as H 2 O 2 , this modification (termed MetSO) provides a mechanism to transform an ROS signal into an on-off switch for a specific phosphosignaling pathway. Because plant cells have enzymes that can reduce MetSO back to Met, this regulatory switch is potentially reversible. Of the 102 synthetic peptide substrates identified here for CPKs, 24 have methionines near the phosphorylation site. However, it is important to note that with one exception, each of the synthetic peptides used here was designed with a norleucine substitution of methionines to prevent MetSO formation. This might have altered recognition of certain sites by specific CPKs. For example, while CPK34 failed to phosphorylate a site in a peptide containing two norleucine substitutions for its methionines (GmSAT S367 ach-376), it showed strong phosphorylation of the same sequence synthesized with both methionines left unsubstituted (ach-375; Table 1). This failure of CPK34 to recognize the norleucine-substituted peptide is in contrast to CPK1, which retained strong phosphorylation of both peptides. Further analysis will be required to determine whether various CPKs will show isoform-specific differences in either increased or decreased substrate recognition as a result of MetSO-site modification.

cPK consensus sItes?
The use of consensus sites to predict kinase phosphorylation sites is sometimes useful, but often inaccurate (Que et al., 2010). Here we provide evidence for CPKs that predictions are usually wrong when based on consensus sites alone. Nearly all of the 534 peptides designed and tested in this study contained a simple-1 CPK consensus site (basic-X-X-S/T). Surprisingly, more than 80% of these peptides failed to show significant phosphorylation. While we cannot rule out that some peptides failed to be phosphorylated because of poor solubility or lack of appropriate secondary structure, analysis of selected peptides with alanine substitutions corroborate that recognition of a simple-1 consensus site can be strongly influenced by flanking residues (e.g., Tables 2 and 3), as previously indicated (Huang and Huber, 2001;Hernandez Sebastia et al., 2004).
The differences in substrate specificity found here provide a partial explanation for the origins of more than seven different consensus sites for CPKs (Harper and Harmon, 2005). The It is also important to emphasize that the kinase assays here were based on recombinant proteins with N-and C-terminal modifications. Although each recombinant kinase had a similar fusion protein design, their respective enzyme kinetics might differ from their native enzyme counterparts under in vivo conditions. bIologIcal relevance of substrate sPecIfIcIty dIfferences The isoform-specific functions of CPKs are potentially influenced by features that control their subcellular location, protein-complex interactions, lipid and Ca 2+ activation kinetics, and substrate specificity. This study was aimed at evaluating the potential influence of substrate specificity.
The extent to which CPKs in Arabidopsis are functionally interchangeable has yet to be determined. However, a consideration of pollen-expressed CPKs provides examples of both redundant and non-redundant activities (Yoon et al., 2006;Myers et al., 2009). For example, a double disruption of Arabidopsis CPK17 and 34 results in a nearly sterile phenotype with pollen tubes showing deficiencies in growth rate, growth potential, and tropism toward ovules (Myers et al., 2009). Because single disruptions in either CPK17 or 34 alone did not show any detectable phenotypes, this pair of CPKs provides an example of functional redundancy. Nevertheless, expression in pollen of 10 other CPKs is not able to compensate for the double knockout of CPK17 and 34. Specifically, one such exogenous kinase, CPK16, is also highly expressed in pollen and is targeted to a similar plasma membrane location (Dammann et al., 2003) as CPK17 and 34 (Myers et al., 2009), but is not able to restore the double loss of function.
While multiple features are expected to make different CPKs functionally distinct, results here suggest that substrate specificity differences could make a significant contribution. In vivo support for this expectation comes from a recent study on NtCDPK1 in which in vitro phosphorylation of a substrate (transcription factor RSG) was shown to be dependent on the variable N-terminal domain of the kinase (Ito et al., 2010). When the N-terminal domain of NtCDPK1 was substituted for the analogous region in a different CPK (AtCPK9), the chimeric kinase displayed in vitro and in vivo activities similar to that of NtCDPK1.

dI19-2, a cPK16-sPecIfIc substrate
Of the candidate phosphorylation sites identified here for CPK16, the S109 site in Di19-2 was the most extensively analyzed. Although this site overlaps with both simple-1 and simple-2 consensus sites for CPKs (Harper and Harmon, 2005), it was not recognized by three other isoforms, CPK1, 10, and 34 -emphasizing that features beyond a simple consensus site are clearly important for substrate recognition by different CPKs.
Di19-2 belongs to a family of seven related proteins in Arabidopsis, structurally defined by the presence of two atypical zinc finger domains. The Di19 name derives from a founding member whose mRNA expression was observed to be drought-induced. While the biological function of Di19-2 is not known, evidence indicates that its mRNA expression can be induced by a salt stress (but not drought stress; Rodriguez Milla et al., 2006). Public domain expression profiling results indicate that it is expressed in all major plant tissue types, including pollen, suggesting a function of potential importance to a subset of stress tolerance pathways in most cell types. Di19-2 is thought to be present in both nuclear and cytosolic materIals and methods
G-GFP-ST (ps627) is a parental vector used for substrate clones described below. It is modified from a pGEX backbone that encodes a N-terminal GST affinity tag, and includes a GFP, a polylinker with AscI and NotI cloning sites, and a C-terminal Strep-II affinity tag.
G-GFP-S109-ST (ps1160) encodes a short sequence of Di19-2 surrounding the S109 phosphorylation site (the amino acid sequence for the S109 site shown in Figure 2). This construct was modified from ps627 by insertion of a DNA fragment from Di19-2 into the AscI/NotI site. Using ps1160, variations were made to produce the mutations shown in Figure 4.
G-GFP-NR-ST (ps699) is identical to ps1160 except that it has an insert that replaces the Di19-2 S109 site (sequence between AscI and NotI) with a sequence from a NR (NR site shown in Figure 2). G-RFP-NR-ST (ps630) was used in some experiments to provide an alternative RFP-tagged (in place of GFP) version of the same NR substrate sequence encoded in ps699.

recombInant ProteIn exPressIon and PurIfIcatIon
All kinase fusion proteins were purified as affinity sandwiches using a 6x His tag at the C-terminal end and a GST tag at the N-terminal end . For substrate fusion proteins encoded by G-Di19-2 S109 (including mutants) and G-GFP-NR-ST, each protein was affinity-purified by a GST tag at the N-terminal end, followed by a Strep-tag-2 (IBA Biotechnology) at the C-terminal end. The fusion protein encoded by G-Di19-2 (ps1195) was purified using a GST affinity tag as described (Rodriguez et al., 2006). Purified proteins were stored frozen at −20°C in 100 mM Tris-HCl, pH 8.5, 0.1 mM EDTA, 10 mM glutathione, and 50% glycerol. The concentration of each sample was determined via the Bradford method (Bradford, 1976), and purity evaluated by separation using SDS-PAGE (10% polyacrylamide gel) and staining with Coomassie Brilliant Blue. All kinase fusions were estimated to be greater than 90% pure. development of a single meaningful consensus site for all CPKs does not appear possible, given that 4 of the 12 subgroups tested showed clear differences in substrate specificity. Thus, the evolution of bioinformatic tools for predicting potential CPK phosphorylation sites will need to incorporate a range of isoform-specific differences.
the frontIer: lInKIng thousands of KInases to theIr targets and bIologIcal functIons CPKs have been proposed as multifunctional kinases, translating Ca 2+ signals into the phospho-regulation of many different substrates (Harper et al., 2004;Klimecka and Muszynska, 2007). This paradigm is strongly supported here by the identification of 103 different substrates for four different CPKs (102 unique synthetic peptides, and residue S109 in Di19-2; Table 1 and Figure 1).
In this study, 274 peptides were designed, each with an in vivomapped phosphorylation site that matched a predicted CPK consensus site. Of these, 74 (27%) were found to be phosphorylated by one of the four CPKs tested. Because CPKs have been shown to be present in most cellular locations (Dammann et al., 2003), a corroboration between in vivo and in vitro phosphorylation provides a reasonable basis for postulating that a particular in vivo phosphorylation event could have been mediated by a CPK. For example, phospho-sites identified in Table 1 support specific hypotheses implicating CPK signaling pathways in regulating diverse aspects of cellular physiology, including auxin efflux (AT3G54820), ethylene response (AT4G23640), mechanosensitive ion channels (AT1G23080), and vesicle trafficking (AT1G10290).
In our study using 274 in vivo-mapped sites that contained a simple-1 motif (excluding reference peptides), we observed a 27% success rate in finding a CPK that recognized the same peptide in an in vitro kinase reaction. By contrast, we observed only a 6% success rate for peptides that were designed without any direct support from in vivo mapping data. However, given a 27% success rate as a precedent, it is reasonable to project that more than 200 additional CPK substrates might be confirmed through the application of this strategy to the nearly 8000 mapped phosphosites listed in PhosPhAt (Heazlewood et al., 2008). Because the list of in vivo-mapped phosphorylation sites is certain to expand, we project that more than 300 phosphorylation sites will soon emerge as potential points of Ca 2+ -dependent phospho-regulation.
The expanding list of CPK substrates is now shifting the scientific frontier from a focus on simply identifying potential CPK phosphorylation sites, to the larger question of understanding the biological functions associated with hundreds of Ca 2+ -triggered changes in phosphorylation. 10, 20, 50, 80, 100, or 200 μM) in a kinase reaction buffer (above). A typical reaction volume was 15 μL. The reaction was allowed to proceed at RT for 12 min and was stopped by spotting an 8-μL aliquot onto P81 phosphocellulose chromatography paper (GE Healthcare Biosciences, Piscataway, NJ, USA). Filters were immediately immersed in 75 mM phosphoric acid (PA), washed 4 × 5 min in 150 mL 75 mM PA, immersed in 95% EtOH for 1 min and airdried. Dry filters were submerged in Ecoscint (Fisher, Pittsburgh, PA, USA) and 33 P counts were determined by a scintillation counter. To calculate activities in nmol P incorporated/min/mg enzyme, the total amount of 33 P label in reactions was determined by counting a 5-μL aliquot of a sample reaction that was dried on filter paper and left unwashed. A Lineweaver-Burk method was used to determine K M s. Results shown are from two independent protein purifications for each CPK, with each preparation assayed at least twice.

PhosPho-sIte maPPIng
Phosphorylation site mapping was performed as previously described (Rodriguez Milla et al., 2006). Briefly, phosphorylated protein was dried at room temperature in a speed-vac, resuspended in a trypsin solution containing 0.4 μg trypsin (Promega, Madison, WI, USA) and 1 mM DTT in 25 mM ammonium bicarbonate, pH 8.0, and digested for 16 h at 37°. The digested peptides were dried and then redissolved in 0.1% (v/v) trifluoroacetic acid (TFA). Peptides were desalted using a reverse phase Zip-Tip C18 column (Millipore, Billerica, MA, USA) pre-equilibrated with 100% (v/v) acetonitrile (ACN) and washed three times with 0.1% (v/v) TFA. Peptides were loaded on the Zip-Tip by five cycles of aspiration and dispensing. The C18 column was washed with 0.1% (v/v) TFA three times and peptides were eluted using a solution of 50% (v/v) ACN and 0.1% (v/v) TFA. The eluate was dried with a speed-vac.
Peptides were analyzed by a neutral loss mass spectrometry method in the UNR Proteomics Core Facility (Schroeder et al., 2004). A reverse phase C18 column (Michrom) was used to separate peptides. The program was set to perform MS/MS/MS whenever it detected neutral loss of 32.6, 38.6, 49, 58, 65, and 98 D. The m/z values of the precursor, y and b ions were retrieved after the scans. Database searching was performed to identify the phosphorylated peptides. For Ion-Trap MS/MS data, m/z values of the precursor, y and b ions were searched against the NCBI database by use of the TURBOSEQUEST algorithm (Thermo Scientific) or MASCOT algorithm 2 (Perkins et al., 1999). All MS/MS spectrum were manually inspected to assure the accuracy of the identification.

acKnowledgments
This work was supported by grants to Jeffrey F. Harper from NSF (MCB-0920624, MCB-0114769, DBI-0420033, and DBI-0436450 for peptide predictions and synthesis), and NIH 1RO1 GM070813-01 (for analysis of Di19-2 sites), and to Ing-Feng Chang from National Science Council, Taiwan (NSC96-3111-B-002-001 and NSC97-2311-B-002-005-MY3), and to Alice C. Harmon from NSF MCB-0114769 for peptide analysis. Mass Spectrometry and Bioinformatics were made possible by the INBRE Program of the National Center for Research Resources (NIH grant P20 RR-016464).

synthetIc PePtIde desIgn and ProductIon
Non-biotinylated syntide-2 was purchased from Bachem (Torrance, CA, USA). Other synthetic peptides were made by Sigma Genosys (St. Louis, MO, USA) with purities estimated at 50%. Peptides used in SPA assays were all made with an N-terminal biotin group. Unless specifically noted, a norleucine was substituted for every methionine. Peptides were designed based on sequences known or predicted to be CPK substrates, as described in the results section.

KInase actIvIty gel-based assays
In vitro kinase assays were performed and modified as previously described (Rodriguez Milla et al., 2006) and labeled proteins separated by SDS-PAGE. Briefly, 50 μM ATP (spiked with 2.5 μCi gamma-32 P (3000 Ci/mmol, MP Biomedicals, Solon, OH, USA) was added to start the kinase reaction (final volume 10 μL) which consisted of 1 ng/μL purified CPK, 100 ng/μL fusion protein substrate, and standard kinase reaction buffer (20 mM Tris-HCl pH 7.5, 10 mM MgCl 2 , 1 mM EGTA, 1.1 mM CaCl 2 ). The reactions were incubated for 10 min at RT and stopped by adding 10 μL 2X SDS sample buffer. Proteins were separated by 12% SDS-PAGE and the gel was air-dried between acetate sheets. The dried gel was processed using either a phosphorimager (Cyclone Storage Phosphor System, PerkinElmer, Boston, MA, USA) or exposure to X-ray film (GE Healthcare Biosciences, Piscataway, NJ, USA). Images were processed using Quantity One (Bio-Rad) to determine the relative intensity of labeling. Signals were normalized to the amount of protein as determined from Coomassie Brilliant Bluestained gels after SDS-PAGE.

PhosPhorylatIon of synthetIc PePtIdes usIng scIntIllatIon ProxImIty assay
Synthetic peptides with N-terminal biotin tags were resuspended in 50% acetonitrile to a final stock concentration of approximately 5 mM. Kinase reaction conditions consisted of 20 μM peptide (or ddH 2 O as a control), 10 μM ATP, 0.0625 μCi 33 P[γ-ATP], 25 ng purified CPK, and 0.1 mg/mL BSA (Bovine Serum Albumin) in standard kinase buffer (10 mM Tris-HCl, pH 7.5, 1 mM MgCl 2 , 1.1 mM CaCl 2 , 1 mM EGTA) in a final reaction volume of 25 μL. Reactions were started by addition of ATP. After 24 min incubation at RT, 10 μL aliquots were diluted with 30 μL dilution buffer (0.13% SDS, 13.3 mM EDTA) and 10 μL of this dilution was added to 200 μL pre-aliquoted stop solution (500 μM ATP, 1% Triton x-100, 1×PBS pH 11.0, and 100 μg SPA beads [streptavidin-coated, yttrium silicate (GE Healthcare)] in 96-well plates. Counts per minute (CPM) for each stopped reaction were determined by a microbeta scintillation counter (PerkinElmer) with a 5-750 window setting, as per manufacturer's recommendations. Those peptide reactions showing at least fourfold greater CPM than a no-peptide control (in an average of two to four independent reactions) were considered to represent significant levels of phosphorylation.

standard fIlter-based assay for KInase actIvIty
Kinetic assays were performed essentially as described . Briefly, kinase reactions were started by adding ATP [final concentration of 50 μM ATP plus 0.375 μCi 33 P[γ-ATP] (3000 Ci/ mmol)] to a reaction that consisted of 1 ng/μL CPK-fusion protein, 0.1 mg/mL BSA, and varied peptide substrate concentrations (0, 5, 2 http://www.matrixscience.com Table S1 | Peptides tested in SPA assays. Each peptide was made with norleucine substitutions for methionines (except ach-375), and a biotin tag on the N-terminal end.