- 1College of Agronomy, Qingdao Agricultural University, Qingdao, China
- 2Tobacco Research Institute, Chinese Academy of Agricultural Sciences, Qingdao, China
- 3Qingdao Municipal Key Laboratory of Plant Molecular Pharming, Tobacco Research Institute, Chinese Academy of Agricultural Sciences, Qingdao, China
- 4Vegetable and Landscape Gardening Institute, Heze Academy of Agricultural Sciences, Heze, China
Introduction: Papain-like cysteine proteases (PLCPs), characterized by a conserved cysteine residue at their active sites, play crucial roles in plant growth, development, and responses to biotic and abiotic stresses. While their importance in plants is well recognized, the characterization of the PLCP gene family in Nicotiana benthamiana (NbPLCP) and its involvement in biotic stress responses remains insufficiently studied. This study aims to identify NbPLCP genes in N. benthamiana and elucidate their roles in facilitating heterologous protein expression.
Methods: A comprehensive bioinformatics approach was employed to identify NbPLCP genes using N. benthamiana genomic data and homology with Arabidopsis thaliana. The analysis included examination of physicochemical properties, phylogenetic relationships, gene structures, conserved motifs, expression profiles, gene duplication events, and chromosomal distribution. Virus-induced gene silencing (VIGS) was utilized to screen for genes affecting heterologous protein expression in N. benthamiana.
Results: We identified and characterized 50 NbPLCP members. Phylogenetic analysis classified these NbPLCPs into nine subfamilies, with uneven distribution across 19 chromosomes. Comparative analysis revealed closer evolutionary relationships between N. benthamiana and Solanum lycopersicum, followed by Arabidopsis thaliana, while showing more distant relationships with Oryza sativa. Functional studies demonstrated that NbPLCPs likely participate in regulating plant growth, development, and biotic stress responses. Importantly, VIGS-mediated silencing of NbXCP1, NbXCP2, and NbXCP3 significantly enhanced the expression of heterologous GFP protein.
Discussion: This study provides comprehensive insights into the PLCP family in N. benthamiana, highlighting their functional significance and potential in heterologous protein expression. Our findings establish a foundation for understanding the evolution and function of NbPLCPs, while demonstrating their potential applications in plant biotechnology for enhancing disease resistance and improving recombinant protein production systems. These results underscore the importance of PLCPs in both plant physiology and biotechnological applications.
1 Introduction
Proteases degrade substrate proteins by hydrolyzing peptide bonds. Based on the chemical nature of their catalytic sites, they can be classified into four categories: cysteine proteases, serine proteases, aspartic proteases, and metallo proteases (Zhang et al., 2023). Among them, cysteine proteases (CPs) rely on the cysteine residue at the active site as a nucleophilic group to catalyze reactions, a key subclass, papain-like cysteine proteases (PLCPs), is characterized by a conserved catalytic triad (Cys-His-Asn) at its core (Cen et al., 2019), by efficiently catalyzing reactions, they regulate plant growth and development, stress responses, and disease resistance processes (Liu et al., 2018; Li et al., 2025; Misas-Villamil et al., 2016; Rawlings et al., 2016). PLCPs are synthesized in the form of zymogens, with their precursors containing an N-terminal signal peptide, an autoinhibitory pro-domain, and a mature 25–35 kDa active protease domain. Some PLCPs also possess an unknown functional particle structure at the C-terminal (Zhang et al., 2023). These subfamilies include the CTB, ALP, RD19, SAG12, RD21, CEP, THI, XBCP, and XCP subfamilies (Kordis and Turk, 2009; Richau et al., 2012), Each subfamily achieves functional differentiation through specific structural domains, such as inhibitor-binding domains or substrate recognition regions (Long et al., 2025).
The activity of PLCPs is notably increased during the development and germination of seeds (Martinez et al., 2009), as well as in fruits (Lin et al., 1993) and various plant organs (van Wyk et al., 2014). In Arabidopsis thaliana, the gene AtSAG12 exhibits a tightly regulated expression pattern associated with leaf senescence, and it has become a widely used molecular marker for studying leaf aging. Moreover, AtRD21 and AtRD19 serve as early-response marker genes for dehydration stress under drought and salt conditions, reflecting the role of PLCPs in the plant’s response to abiotic stresses (Guo and Gan, 2014; Lohman et al., 1994). In Oryza sativa, PLCPs assist in pathogen defense by broadly activating signaling pathways and transcription factors that coordinate downstream responses. This includes the upregulation of various disease-related proteins and the biosynthesis of secondary metabolites, which contribute to resistance mechanisms (Nino et al., 2020). As a critical component of the proteolytic machinery, PLCPs are responsible for the degradation of intracellular proteins, playing an essential role in regulating programmed cell death (PCD). PCD is a highly regulated biological process that is integral to numerous aspects of plant development, as well as responses to both biotic and abiotic stressors (Liu et al., 2018). In the context of Medicago truncatula, MtCP6 is induced during both developmental and stress-induced nodule senescence. Its early expression promotes nodule senescence, while MtCP77 positively regulates root nodule aging by accelerating plant PCD and reactive oxygen species (ROS) accumulation (Deng et al., 2019; Pierre et al., 2014). Furthermore, PLCPs are closely associated with plant resistance to herbivory. For instance, papain, a well-known PLCP found in the latex exudate of papaya, plays a critical role in defending the plant against herbivorous insects, such as lepidopteran larvae (Konno et al., 2004). This highlights the multifaceted role of PLCPs in both plant development and stress response, emphasizing their importance in the regulation of cellular processes and interactions with environmental factors. Although the functional roles of PLCPs have been extensively investigated in other species, research on this family in N. benthamiana remains unexplored.
Tobacco (Nicotiana benthamiana), a model plant, is widely utilized in the study of plant innate immunity and defense signaling pathways. It is particularly suitable for virus-induced gene silencing and transient gene overexpression through Agrobacterium-mediated infiltration (Bally et al., 2018). In recent years, N. benthamiana has gained attention as a biological reactor for the production of vaccines and therapeutic proteins (Douglas, 2018). However, the expression of exogenous proteins in N. benthamiana can be influenced by a variety of factors. One such factor is the interference by endogenous proteases in N. benthamiana, which can reduce the accumulation of exogenous proteins (Jutras et al., 2020). Among the various protease inhibitors identified, the co-expression of four specific inhibitors, namely SICYS8, NbPR4, NbPot1, and human HsTIMP, has been shown to significantly enhance the expression levels of exogenous proteins when compared to controls (Grosse-Holz et al., 2018).The SICYS8 protease inhibitor targets papain-like cysteine proteases. Strong inhibition of nine distinct PLCPs in N. benthamiana leaves has been observed upon the expression of SICYS8, suggesting a direct and potent effect on PLCP activity (Jutras et al., 2019). Although several members of the PLCP family have been extensively studied in N. benthamiana, a comprehensive investigation into the phylogeny of the entire PLCP gene family has yet to be undertaken.
In this study, we utilized the latest genomic data to systematically identify and analyze the PLCP gene family in N. benthamiana. Our comprehensive analysis included the construction of a phylogenetic tree, investigation of gene structures, identification of cis-regulatory elements, synteny analysis, expression profiling, and assessment of how specific PLCP family members influence the expression of the exogenous green fluorescent protein (GFP). These findings provide valuable insights into the evolutionary dynamics and functional roles of the PLCP gene family in N. benthamiana. Furthermore, this research contributes to a deeper understanding of N. benthamiana as a plant-based bioreactor, with potential applications in optimizing protein production within this species.
2 Materials and methods
2.1 Plant materials and growth conditions
The experimental materials used in this study were the N. benthamiana seeds kept by the Chinese Academy of Agricultural Sciences, and the designated tobacco seeds were sown on the soil matrix. After germination, the seedlings were transplanted to 25°C, 16 h/8 h (light/dark) photoperiod conditions to 6 to 8 leaf stage for Agrobacterium infiltration and inoculation.Samples were taken at 0h, 3 h, 12 h, 24 h, 36 h, 48 h and 72 h, and stored at -80°C after snap freezing in liquid nitrogen.
2.2 Identification and sequence analysis of the NbPLCP gene family
In order to search the members of the PLCP gene family, we downloaded the protein sequences of all the Arabidopsis PLCP family members from the TAIR (www.arabidopsis.org) Arabidopsis database, and analyzed these PLCP family members from the Pfam (http://pfam.xfam.org/) database, which showed that they generally possess the conserved domain of PF00112. Also we downloaded the genome and protein files of the NbeHZ1 version from the Nicomics (http://lifenglab.hzau.edu.cn/Nicomics/index.php) website. The hidden horse model file of PLCP gene family was found and downloaded through Pfam (http://pfam.xfam.org/) website using the hmmsearch program to set the E value of 1e-20 threshold; and the protein sequence of AtPLCPs family was Query to search the possible NbPLCPs sequence through BLAST search. The protein sequences obtained from both methods were removed after redundancy and uploaded to CDD (https://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi) and SMART (http://smart.embl.de/smart/set_mode.cgi?NORMAL=1) for domain validation to remove protein sequences without the PF000112 domain (Finn et al., 2014; Schultz et al., 1998).
By visiting the Expasy online platform (http://www.expasy.org/tools/protparam.html) and using its ProtParam tool, various physiochemical properties of NbPLCP protein members, including protein length, molecular weight, isoelectric point, aliphatic amino acid content and hydrophobicity index. The NbPLCP was named based on its chromosomal location (Gasteiger et al., 2003).
2.3 The evolutionary tree analysis of the NbPLCPs gene family members
The PLCP protein sequences of Arabidopsis thaliana, rice (Oryza sativa) and tomato (Solanum lycopersicum) can be downloaded from the Uniprot database (https://www.uniprot.org). A phylogenetic tree of these species and the PLCP proteins was constructed using MEGA11. Sequence ratios were performed using the ClustalW method, excluding non-conserved regions outside the aligned domains. A phylogenetic tree was constructed using a maximum likelihood method with a bootstrap value of 1000 (Kumar et al., 2018).
2.4 Analysis of gene structure, protein domains, and conserved motifs
Online tool MEME (http://meme-suite.org/) was used to predict conserved motifs in NbPLCP proteins, with motif lengths set to 6–100 with up to 10 motifs identified while maintaining default values for other parameters (Bailey et al., 2015). Using Batch CD-Search (https://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi) for the NbPLCP family, And the visualization of their protein-conserved domains using TBtools, Gene structure analysis based on the genomic DNA and CDS sequences of the NbPLCP gene family members, And using the online gene structure visualization server 2.0 (https://gsds.gao-lab.org/Gsds_help.php) for gene structure visualization.
2.5 Identification of cis-acting regulatory elements in the PLCP gene
From the 2000 bp region from the genome sequence file, the cis-regulatory elements (CAREs) of these promoter regions were then analyzed using PlantCARE software (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) (Lescot et al., 2002). Finally, tbtools presents all the identified elements of the promoter region of the NbPLCP gene as a heatmap.
2.6 Chromosomal localization and collinearity analysis
Information on the start and stop positions of PLCPs genes was extracted from the NbeHZ1 genome Gff3 file in TBtools software and mapped on the corresponding chromosome. For NbPLCPs analysis within species, TBtools software Fasta Tools and Blast tools, including chromosome length, gene location files and corresponding alignment files were used (Chen et al., 2020). PLCPs were analyzed using the One step MCScan X tool, and E value was set to 1e-5. The resulting Collinearity file was used for fragment duplication gene analysis and Tandem file was used for tandem repeat gene analysis (Wang et al., 2012). Use the KaKs_Calculator 3.0 software to calculate the non-synonymous to synonymous substitution ratio (Ka/Ks) of homologous gene pairs within the NbPLCP gene family (Zhang, 2022).
2.7 Experimental processing and sampling
Synthesize the nucleic acid sequence of SICYS8 (Sainsbury et al., 2013) and construct the pEAQ::SCYS8 overexpression vector using T4 ligation technology. Inject pEAQ::SCYS8 and TRBO::GFP into one side of the leaves of 4-week-old Nicotiana benthamiana, and co-inject empty pEAQ vector and TRBO::GFP into the other side of the leaves as a control group. Take photos and collect samples 3 days later, extract total protein from the samples, and use it for subsequent Western Blot analysis to detect the expression level of green fluorescent protein (GFP).
Design transient silencing fragments of the NbRD21, NbCTB, NbRD19, NbXBCP, and NbCEP subfamilies using the VIGS TOOL, Based on the conservative sequences of the same subfamily members, several vectors were designed to simultaneously silence multiple genes the primers used are shown in Supplementary Table S1. The specific vectors include: TRV2::NbRD21BD (silencing NbRD21B and NbRD21D), TRV2::NbRD21E (silencing NbRD21E), TRV2::NbXBCP17 (silencing NbXBCP1 and NbXBCP7), TRV2::NbXBCP345 (silencing NbXBCP3, NbXBCP4, and NbXBCP5), TRV2::NbXBCP26 (silencing NbXBCP2 and NbXBCP6), TRV2::NbRD19ABC (silencing NbRD19A, NbRD19B, and NbRD19C), TRV2::NbRD19E (silencing NbRD19E), TRV2::NbCTB13 (silencing NbCTB1 and NbCTB3), TRV2::NbCTB2 (silencing NbCTB2), and TRV2::NbXCP123 (silencing NbXCP1, NbXCP2, and NbXCP3).using homologous recombination technology. Construct the TRV2::GUS vector as a control. Co-inject Agrobacterium containing TRV2 experimental group with TRV1 into the leaves of 2-week-old Nicotiana benthamiana as the experimental treatment, and co-inject TRV2::GUS with TRV1 as the control. Fifteen days later, inject TRBO::GFP Agrobacterium into the T and CK groups, take photos, and collect samples 3 days later. Extract RNA and total protein from the samples for subsequent quantitative fluorescence analysis and Western Blot analysis to detect the expression level of green fluorescent protein (GFP).
2.8 Protein extraction and Western Blot detection
Nicotiana benthamiana leaves were ground in liquid nitrogen using a mortar and pestle, followed by the addition of Tris-HCl buffer (pH 8.0), 2.5 M NaCl (pH 8.0), EDTA (pH 8.0), glycerol, 100 mM PMSF, and ddH2O. The samples were incubated on ice for 30 minutes and then centrifuged at 12,000 rpm for 10 minutes at 4°C. The supernatant was collected and mixed with 5× SDS-PAGE protein loading buffer (Solarbio P1040) at an appropriate ratio. The mixture was heated at 95°C for 10 minutes in a metal bath. Proteins were separated by 12.5% SDS-PAGE gel electrophoresis and transferred to a PVDF membrane using the Trans-Blot Turbo system (Bio-Rad, Hercules, CA). The membrane was blocked in TBS-T containing 5% non-fat milk and incubated with Anti-GFP antibody (1:8000; Abcam, ab6556) and goat anti-rabbit IgG H&L (HRP) secondary antibody (1:4000; Abcam, ab6721).
2.9 RNA extraction and quantitative real-time PCR analysis
Whole RNA of N. benthamiana from control and processing group extracted with an RNA extraction kit (Beijing Kangwei Century Company, CW0581M). The cDNA was reverse transcription kit (Nanjing Norzan Biotechnology Co., Ltd., R333-01). For real-time PCR (Real-time PCR) detection, 10-fold diluted cDNA was used as the reaction template and a set of specific PCR reaction process: at the starting stage, 3 minutes pre-denaturation at 95°C; then 40 PCR cycles with 10 seconds 95°C denaturation phase and 30 seconds 60°C extension phase; and the flow parameters were 15 seconds 95°C treatment, 60 seconds 60°C treatment, and finally 15 seconds 95°C treatment. Data were calculated and analyzed using the 2-ΔΔCT method (Rao et al., 2013), with NbActin as the reference gene. The results were compared and integrated with other data. Significance analysis was performed using the ANOVA method with a significance level set at 0.05. The specific primers applied in this trial are detailed in Supplementary Table S2, and three experimental replicates were performed for each sample.
3 Results
3.1 Identification and annotation of NbPLCPs
Using the HMMER search technique, a total of 50 PLCP family genes were successfully identified (Table 1). The corresponding proteins were analyzed using protein analysis tools to determine their physicochemical properties. The protein lengths ranged from 156 to 925 amino acids, with molecular weights ranging from 17,147.41 Da to 104,611.01 Da. The theoretical isoelectric points (pI) varied significantly, with NbRD19G showing the highest pI and NbTHI2 showing the lowest. NbXBCP6 exhibited the highest instability index, while NbRD19C had the lowest aliphatic index. These parameters enhanced our understanding of the characteristics of the NbPLCP family, providing deeper insights into their structure and physicochemical properties. Additionally, the subcellular localization of the proteins encoded by NbPLCP was predicted, and the results indicated that most family members are localized to vacuoles and the endoplasmic reticulum. Specifically, NbXBCP6, NbRD19C, and NbRD19F were predicted to be localized to the nucleus, NbRD19H was predicted to be localized to the Golgi apparatus, and NbPAP3 was predicted to be localized to the membrane, while NbCTB1 was predicted to be localized to both vacuoles and chloroplasts.
3.2 Phyloevolutionary analysis of the NbPLCP families
To further investigate the evolutionary relationships of NbPLCP, we constructed a phylogenetic tree using PLCP protein sequences from different species (Figure 1). Based on previous studies, PLCPs were classified into nine subfamilies: CTB, ALP, RD19, SAG12, CEP, THI, XBCP, XCP, and RD21. The NbPLCP family members are distributed across these nine subfamilies. The ALP subfamily contains the fewest NbPLCP members, with only two members; the CTB, THI, and XCP subfamilies each contain three members; the CEP subfamily has six members; the RD21 and XBCP subfamilies each have seven members; and the RD19 subfamily contains eight members. The SAG12 subfamily has the largest number of NbPLCP members, with eleven. Evolutionary analysis across different species indicates that the evolutionary mechanisms of the PLCP gene family are relatively conserved.

Figure 1. Evolutionary relationships of members of the NbPLCP family with Arabidopsis, tomato and rice.
3.3 Conserved motifs, protein structure, and gene structure analysis of NbPLCPs
We analyzed the gene structure of NbPLCPs, and showed that these genes contained more than one exon, ranging from 1 to 20 (Figure 2), number of the same exons for the same members of the same subfamily.Such as the number of exons in the XCP subfamily, the XBCP subfamily, and the CEP subfamily. The member with the highest number of exons, NbCTB1, located in the CTB subfamily, has 20 exons. Although other subfamily members do not have uniform numbers of exons, they possess very close numbers of exons.

Figure 2. Structure and motif analysis of the NbPLCPs gene. (A) Phylogenetic tree of NbPLCPs constructed using MEGA. (B) A map of the exon structures displayed using the GSDS web site. (C) Analysis of 10 Motifs in the meme website construction. (D) Analysis of the conserved motifs.
To further understand the structural and functional characteristics of NbPLCPs family members, 10 conserved motifs in NbPLCPs proteins were identified using MEME software (Figure 2), with only one Motif4 in NbRD19F and all members except NbRD19G without Motif4 having Motif4. Where Motif1 is present in other members except the four members, NbRD19F, NbTHI1, NbTHI3 and NbSAG12G. Combined with the phylogenetic tree, we shows that the protein structure of the members on the same branch is basically consistent, suggesting that the members on the same branch may continue their function. In summary, the analysis of conserved motifs, protein domains, and gene structure of gene family members provides strong evidence for the results of the phylogenetic analysis.
3.4 Gene duplication analysis and chromosomal distribution of NbPLCPs
In order to better understand the distribution of the NbPLCP gene family members across the genome, based on the current N. benthamiana genome, 50 NbPLCP genes are unevenly distributed across 17 of the 19 chromosomes (Figure 3). Chromosome 18 harbors the highest number of NbPLCP genes (10 genes), while chromosomes 2, 8, and 9 contain only one NbPLCP gene each. Two NbPLCP genes are located on chromosomes 1, 4, 5, 6, 14, 15, and 19 respectively, whereas three NbPLCP genes are found on chromosomes 12, 13, and 16. Chromosomes 7 and 17 each possess four NbPLCP genes, and chromosome 3 accommodates six NbPLCP genes. Subsequent analyses identified 22 segmental duplication events among 23 NbPLCP genes, with the NbXBCP5/NbTHI3 segmental duplication occurring across two subfamilies, while the remaining 21 segmental duplications occurred within the same subfamily. Notably, no tandem duplications were detected, suggesting that segmental duplications play a pivotal role in the evolution of the NbPLCP gene family. Additionally, the selection pressure analysis of the NbPLCP genes (Table 2) revealed that the Ka/Ks value for the NbXBCP5/NbTHI3 gene pair (0.543), although less than 1, is greater than 0.5, indicating strong functional constraints on these genes during evolution, with certain sites undergoing adaptive changes.

Figure 3. (A) Chromosome positioning is based on the location of 17 chromosomes (Mb), the proportion on the left is trillion 1 billion (Mb), and the number of chromosomes is located on the left of each chromosome. (B) The inter-genomic collinearity of PLCP genes in N. benthamiana. The red line represents the segmental duplication event among PLCP family members.

Table 2. Ratios of nonsynonymous (Ka) and synonymous (Ks) of NbPLCP gene fragment duplication pairs in Nicotiana benthamiana.
3.5 Investigation of co-linear relationships between N. benthamiana and other species
In order to enhance our understanding of the evolutionary relationships within the NbPLCP gene family, we conducted a comparative analysis. This analysis examined the collinearity of NbPLCP genes in N. benthamiana with those from three other plant species (see Figure 4). The results indicated that there are 33, 15, and 2 NbPLCP genes exhibiting collinearity with the NbPLCP genes of tomato, Arabidopsis, and rice, respectively. Among these three species, the number of collinear gene pairs between N. benthamiana and each species was found to be 52, 23, and 2, respectively. Notably, the genes NbRD19A, NbRD21A, NbRDL1, NbSAG12A, NbRDL2, NbCEP1, and NbXCP2 in N. benthamiana displayed consistent chromosomal positions with their homologous genes in tomato, suggesting that these genes have maintained a similar genomic structure throughout evolution. In contrast, within the collinear gene pairs with Arabidopsis, only NbCEP1 exhibited consistent chromosomal positioning with its homologous gene in Arabidopsis. The differences in ecological adaptability between N. benthamiana and rice reflect the distinct selective pressures they have encountered during evolution, leading to diversity in genomic structure and function. The comparative genomic data between N. benthamiana, tomato, Arabidopsis, and rice reveal significant insights into their evolutionary relationships, gene homology, and functional conservation, providing a foundational understanding of plant genetic diversity and evolutionary history.

Figure 4. Synteny analysis between Nicotiana benthamiana and other species. Gray lines indicate all collinear relationships between different chromosomes, while colored lines represent collinear analysis specific to NbPLCP family genes. (A) Synteny analysis between N. benthamiana and Solanum lycopersicum. (B) Synteny analysis between N. benthamiana and Arabidopsis thaliana. (C) Synteny analysis between N. benthamiana and Oryza sativa.
3.6 Analysis of promoter cis-acting elements of NbPLCP gene
Research into NbPLCP’s potential involvement in stress responses necessitates a thorough analysis of the cis-acting elements within the promoter regions of these genes. This study aims to uncover various elements associated with abiotic and biotic stresses, plant hormones, and growth and development (Figure 5), offering significant insights into the regulatory mechanisms of NbPLCP in response to environmental stimuli. Under abiotic and biotic stresses, genes in the NbRD21 subfamily display diverse expression patterns under adverse conditions. Notably, NbRD21D possesses high binding capacities to promoter elements like DRE and MBS, highlighting its crucial role in drought and salt stresses. By modulating the expression of genes related to stress resistance, members of the NbRD21 subfamily enhance the plant’s adaptability to environmental pressures. NbCEP2 shows maximum enrichment under cold and salt stresses, indicating its significance in adversity response. Moreover, NbXCP1’s high enrichment under oxidative stress suggests its link to plant antioxidative capabilities, potentially enhancing stress resistance through the modulation of relevant signaling pathways. Within the NbSAG12 subfamily, increased enrichment of NbSAG12B on drought and high temperature regulatory elements suggests its key role in plant senescence and stress response. Additionally, changes in the enrichment of adversity cis-acting elements in members of the NbALP and NbCTB subfamilies also demonstrate their roles in plant growth and stress resistance. Particularly, the cold and saline-alkali stress elements of NbCTB2 underscore its critical role in plant stress resistance. Finally, variations in multiple adversity elements in members of the NbRD19 subfamily further emphasize the importance of promoter elements in regulating plant stress responses.

Figure 5. Analysis of promoter cis-acting elements in response to light signal, phytohormone, growth and development, and abiotic stress.
3.7 NbPLCP dynamic expression patterns in response to viral infection
To gain a better understanding of the role of the NbPLCP protease family in the N. benthamiana bioreactor, this study analyzed the dynamic expression patterns of NbPLCP in response to viral infection (Figure 6). Most members of this family responded to viral infection at varying degrees across different time points, with diverse time-dependent expression characteristics observed among different genes during the infection process, reflecting their potential roles in plant immune responses. Furthermore, some genes displayed stage-specific expression peaks, such as NbRD21C, which significantly increased after 12 hours of infection and then decreased, suggesting that it may have a key regulatory function at specific stages of infection. Similarly, the NbXCP and NbRD19 subfamily genes exhibited coordinated expression changes, indicating that they may be regulated by a common regulatory network, potentially participating in the regulation of the same biological pathway or engaging in crosstalk between different immune signaling pathways. The synchronous expression observed within gene clusters further suggests the involvement of potential regulatory factors and co-regulatory mechanisms in driving their responses. For instance, NbXCP1, NbXCP2, and NbXCP3 exhibited similar expression patterns, gradually increasing after infection, peaking at 48 hours, and then slightly declining thereafter. These findings highlight the intricate regulatory networks and diverse functional roles of NbPLCP family members in plant immunity.

Figure 6. Analysis of NbPLCP gene expression patterns under viral infection treatment based on RNA-seq data. The legend represents the expression levels of NbPLCP genes, ranging from high expression (red) to low expression (white).
3.8 The protease inhibitor SICYS8 enhances the expression of exogenous GFP protein
Previous studies have shown that the protease inhibitor SICYS8 inhibits the hydrolytic activity of the PLCP protease family, thereby increasing the expression of exogenous proteins in N. benthamiana. In this study, we transiently expressed GFP along with SICYS8 in N. benthamiana to investigate the impact of the NbPLCP protease inhibitor SICYS8 on the biological response of the N. benthamiana bioreactor. From the phenotype, it was observed that the green fluorescence intensity on the right side, where only GFP was expressed, was lower than on the left side, where both SICYS8 and GFP were co-expressed (Figure 7). Western blot analysis revealed that the GFP protein content was significantly higher in the samples co-expressing SICYS8 compared to the control group (Figure 7). This indicates that SICYS8 can enhance the expression of recombinant proteins in N. benthamiana by inhibiting the activity of proteinases.

Figure 7. (A) Agroinfiltration of Nicotiana benthamiana leaves, with 35s::GFP infiltrated on the left side and a 1:1 mixture of 35s::GFP and 35s::SICYS8 infiltrated on the right side. (B) Western blot analysis of the samples from panel B, showing the detection results of GFP protein (molecular weight: 27 kDa).
3.9 The effect of PLCP family members on the expression of exogenous GFP in Nicotiana benthamiana
As shown in Figure 6, following infection with exogenous viruses, changes in the expression levels of PLCP family members are primarily concentrated in the NbRD21, NbCTB, NbRD19, NbXBCP, and NbCEP subfamilies. Based on the phylogenetic relationships of these subfamily members illustrated in Figure 2, we constructed ten transient silencing vectors: TRV2::NbRD21BD, TRV2::NbRD21E, TRV2::NbXBCP17, TRV2::NbXBCP345, TRV2::NbXBCP26, TRV2::NbRD19ABC, TRV2::NbRD19E, TRV2::NbCTB13, TRV2::NbCTB2, and TRV2::NbXCP123. Using transient silencing technology, these genes were knocked down. Following the knockdown, the plants were infected with green fluorescent protein (GFP), and the GFP expression level was observed and measured.
The results of the comparison showed that GFP fluorescence intensity in NbXCP123-silenced plants was higher than that in the control plants (Figure 8). As shown in Figure 8, the expression levels of three genes, NbXCP1, NbXCP2, and NbXCP3, were analyzed in Nicotiana benthamiana leaves infected with either GFP or an empty vector control over a three-day period. Compared to the control group, significant changes were observed in the GFP-infected plants after 24 hours. Specifically, the expression of NbXCP1 and NbXCP3 increased overall and stabilized after 24 hours, while NbXCP2 showed an initial increase followed by a decrease. Figure 8 presents the results of a Western blot analysis of GFP expression in NbXCP123-silenced plants and control plants. The results indicate that GFP expression in the NbXCP123-silenced plants was higher than in the control plants. As shown in Figure 8, the three target genes, NbXCP1, NbXCP2, and NbXCP3, were significantly downregulated in NbXCP123-silenced plants compared to the control plants. These results demonstrate that the expression levels of XCP subfamily members in Nicotiana benthamiana were indeed altered after the infection with the exogenous GFP protein, showing a marked increase. Moreover, an obvious elevation in GFP expression was observed and detected in the transiently silenced plants.

Figure 8. (A) Phenotypic comparison of TRBO::GFP infection in control plants and NbXCP123-silenced plants. (B) The expression levels of NbXCP1, NbXCP2, and NbXCP3 in Nicotiana benthamiana infected with TRBO::00 and TRBO::GFP were analyzed at 0 h, 3 h, 12 h, 24 h, 36 h, 48 h, and 72 (h) (C) Western blot analysis of GFP expression in control plants and NbXCP123-silenced plants. (D) Expression levels of NbXCP1, NbXCP2, and NbXCP3 in control plants and NbXCP123-silenced plants.
4 Discussion
The PLCP (Papain-Like Cysteine Protease) family is one of the most abundant cysteine protease families in plants, participating in various life processes, including senescence, pollen development, fruit ripening, and seed germination (Liu et al., 2018). To date, systematic analyses of the PLCP family have been conducted in species such as Arabidopsis thaliana (Richau et al., 2012), rice (Nino et al., 2020), chili pepper (Chen et al., 2024), soybean (Yuan et al., 2020), grape (Kang et al., 2021), and papaya (Liu et al., 2018). In this study, we identified 50 members of the PLCP family in the genome of N. benthamiana. Phylogenetic analysis classified these genes into nine subfamilies, with members of the same subfamily showing similarities in gene structure and conserved motifs. The differences in amino acid length, molecular weight, isoelectric point, aliphatic amino acid ratio, hydrophobicity index, and chromosomal location of NbPLCP proteins suggest that these variations may be related to the functional diversity of these family members.
Gene structure analysis reveals significant differences in intron and exon compositions among the members of the N. benthamiana PLCP gene family. For example, members of the XCP, XBCP, and CEP subfamilies exhibit consistency in the number of exons, reflecting the conservation of these genes in specific biological functions. The lack of conserved motifs in some PLCP family members suggests potential functional diversification within the family, indicating that NbPLCP proteins may play diverse roles in various physiological processes. Members from the same evolutionary branch display highly consistent protein structures, pointing to functional conservation throughout evolution. By integrating conserved motif and gene structure analysis, the reliability of the phylogenetic tree is further validated, providing strong support for functional predictions of family members.
Additionally, 50 members of the N. benthamiana PLCP gene family are distributed across 17 chromosomes (Table 1, Figure 3). Among these, 22 pairs of segmental duplication genes were identified, involving 23 NbPLCP genes, while no tandem duplication gene pairs were detected. This contrasts with previous studies in other species, where tandem duplication gene pairs were more likely to occur than segmental duplications within the PLCP family members of grapevines (Kang et al., 2021). Similar findings were also observed in papaya (Liu et al., 2018). This suggests that segmental duplication plays a crucial role in the evolution of the NbPLCP gene family. Moreover, the Ka/Ks ratio for all gene pairs was less than 1, indicating that these gene pairs have undergone purifying selection to prevent the spread of harmful mutations. Comparative genomics analysis revealed 52 pairs of collinear genes between N. benthamiana and tomato, 23 pairs with Arabidopsis, and 2 pairs with rice. These results highlight the significant evolutionary relationship, gene homology, and functional conservation between N. benthamiana and these species, providing fundamental insights into plant genetic diversity and evolutionary history.
This study provides an in-depth analysis of the phylogeny, gene structure, and the impact of PLCP genes on the expression of exogenous green fluorescent protein (GFP), revealing the importance of PLCPs in plant physiological processes and their potential applications in bioreactors. Previous studies have shown that PLCP protease families are common targets for pathogen effectors (Misas-Villamil et al., 2016), such as XCP2 (Zhang et al., 2014) and CEP2 (Mueller et al., 2013). The VvRD21–1 gene plays a significant role in disease resistance in Vitis vinifera L (Kang et al., 2021). Similarly, in cotton, the orthologous gene GhRD21–7 of NbRD21D in N. benthamiana has been shown to enhance resistance to Verticillium dahliae in overexpression plants (Zhang et al., 2019). The results indicate that PLCP family members exhibit significant expression changes during plant development and stress responses. We found that genes such as NbTHI2 and NbCEP1 are rapidly activated in the early stages of infection, exhibiting characteristics typical of acute response mechanisms, suggesting their involvement in the initiation of defense responses and signal transduction during the early stages of viral invasion. In contrast, genes such as NbRD21D and NbALP2 maintain high expression levels throughout the infection process, suggesting they may play broader regulatory roles at multiple stages of the immune response. These findings suggest that silencing, knockout, or mutation of PLCP family members in plants makes them more susceptible to pathogen infection.
Through functional analysis of PLCP family members, we found that specific PLCP inhibitors (such as SICYS8) can significantly enhance the expression of exogenous GFP, indicating that the interference with endogenous proteases is a key factor affecting their accumulation. In the experiments, the GFP expression in plants with NbXCP123 silencing was significantly higher than in the control group, further validating the potential role of PLCPs in regulating exogenous protein expression. This finding provides a new strategy for optimizing N. benthamiana as a bioreactor, where the silencing or knockout of the genes NbXCP1, NbXCP2, and NbXCP3 can potentially enhance the yield of vaccines and therapeutic proteins. Furthermore, the phylogenetic and gene structure analysis of the PLCP family lays the foundation for understanding its functional diversity. Although some PLCP members have been studied previously, this research fills the gap in phylogenetic analysis, revealing the conservation and specificity of PLCPs in different plant species. This work paves the way for future comparative functional studies of PLCPs in multiple plant species.
5 Conclusion
In summary, this study identified 50 members of the NbPLCP protease family through genome-wide analysis. The functional roles of the PLCP family in N . benthamiana and their importance in exogenous protein expression were thoroughly investigated, highlighting the potential of PLCP in plant physiology and biotechnological applications. Future research should further explore the regulatory mechanisms of PLCP and how these mechanisms can be leveraged to improve plant stress resistance and the efficiency of exogenous protein expression, thereby advancing the application of plant-based bioreactors.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.
Author contributions
YLY: Data curation, Formal Analysis, Writing – original draft, Writing – review & editing. ZD: Data curation, Writing – original draft. ZZ: Visualization, Writing – review & editing. YY: Funding acquisition, Writing – review & editing. XP: Data curation, Methodology, Writing – review & editing. RW: Investigation, Writing – review & editing. TL: Validation, Writing – review & editing. XG: Investigation, Writing – review & editing. LL: Data curation, Writing – original draft. YG: Data curation, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by funds from the National Natural Science Foundation of China (No.32370374 and 32270332) and China National Tobacco Corporation Guizhou Branch Science and Technology Program Project (201911).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The authors declare that this study received funding from China National Tobacco Corporation Guizhou Branch Science and Technology Program Project. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2025.1565487/full#supplementary-material
References
Bailey, T. L., Johnson, J., Grant, C. E., and Noble, W. S. (2015). ). The meme suite. Nucleic Acids Res. 43, W39–W49. doi: 10.1093/nar/gkv416
Bally, J., Jung, H., Mortimer, C., Naim, F., Philips, J. G., Hellens, R., et al. (2018). The rise and rise of Nicotiana Benthamiana: a plant for all reasons. Annu. Rev. Phytopathol. 56, 405–426. doi: 10.1146/annurev-phyto-080417-050141
Cen, Y., Singh, W., Arkin, M., Moody, T. S., Huang, M., Zhou, J., et al. (2019). Artificial cysteine-lipases with high activity and altered catalytic mechanism created by laboratory evolution. Nat. Commun. 10, 3198. doi: 10.1038/s41467-019-11155-3
Chen, C., Chen, H., Zhang, Y., Thomas, H. R., Frank, M. H., He, Y., et al. (2020). Tbtools: an integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 13, 1194–1202. doi: 10.1016/j.molp.2020.06.009
Chen, R., Wang, B., Huang, S., Chen, X., Tan, J., Zhang, H., et al. (2024). Genome-wide identification and male sterility-related expression analysis of papain - like cysteine protease gene family in capsicum annuum. Horticulturae 10, 892. doi: 10.3390/horticulturae10080892
Deng, J., Zhu, F., Liu, J., Zhao, Y., Wen, J., Wang, T., et al. (2019). Transcription factor bhlh2 represses cysteine protease77 to negatively regulate nodule senescence. Plant Physiol. 181, 1683–1703. doi: 10.1104/pp.19.00574
Douglas, A. E. (2018). Strategies for enhanced crop resistance to insect pests. Annu. Rev. Plant Biol. 69, 637–660. doi: 10.1146/annurev-arplant-042817-040248
Finn, R. D., Bateman, A., Clements, J., Coggill, P., Eberhardt, R. Y., Eddy, S. R., et al. (2014). Pfam: the protein families database. Nucleic Acids Res. 42, D222–D230. doi: 10.1093/nar/gkt1223
Gasteiger, E., Gattiker, A., Hoogland, C., Ivanyi, I., Appel, R. D., and Bairoch, A. (2003). Expasy: the proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 31, 3784–3788. doi: 10.1093/nar/gkg563
Grosse-Holz, F., Madeira, L., Zahid, M. A., Songer, M., Kourelis, J., Fesenko, M., et al. (2018). Three unrelated protease inhibitors enhance accumulation of pharmaceutical recombinant proteins in Nicotiana Benthamiana. Plant Biotechnol. J. 16, 1797–1810. doi: 10.1111/pbi.12916
Guo, Y. and Gan, S. S. (2014). Translational researches on leaf senescence for enhancing plant productivity and quality. J. Exp. Bot. 65, 3901–3913. doi: 10.1093/jxb/eru248
Jutras, P. V., Dodds, I., and van der Hoorn, R. A. (2020). Proteases of Nicotiana Benthamiana: an emerging battle for molecular farming. Curr. Opin. Biotechnol. 61, 60–65. doi: 10.1016/j.copbio.2019.10.006
Jutras, P. V., Grosse-Holz, F., Kaschani, F., Kaiser, M., Michaud, D., and van der Hoorn, R. (2019). Activity-based proteomics reveals nine target proteases for the recombinant protein-stabilizing inhibitor slcys8 in Nicotiana Benthamiana. Plant Biotechnol. J. 17, 1670–1678. doi: 10.1111/pbi.13092
Kang, J., Gong, P., Ge, M., Sadeghnezhad, E., Liu, Z., Zhang, M., et al. (2021). The plcp gene family of grapevine (vitis vinifera l.): Characterization and differential expression in response to plasmopara viticola. BMC Plant Biol. 21, 499. doi: 10.1186/s12870-021-03279-w
Konno, K., Hirayama, C., Nakamura, M., Tateishi, K., Tamura, Y., Hattori, M., et al. (2004). Papain protects papaya trees from herbivorous insects: role of cysteine proteases in latex. Plant J. 37, 370–378. doi: 10.1046/j.1365-313x.2003.01968.x
Kordis, D. and Turk, V. (2009). Phylogenomic analysis of the cystatin superfamily in eukaryotes and prokaryotes. BMC Evol. Biol. 9, 266. doi: 10.1186/1471-2148-9-266
Kumar, S., Stecher, G., Li, M., Knyaz, C., and Tamura, K. (2018). Mega x: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549. doi: 10.1093/molbev/msy096
Lescot, M., Dhais, P., Thijs, G., Marchal, K., Moreau, Y., Van de Peer, Y., et al. (2002). Plantcare, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 30, 325–327. doi: 10.1093/nar/30.1.325
Li, C., Li, S., Feng, L., Cheng, J., Xie, J., Lin, Y., et al. (2025). Arabidopsis otu2 deubiquitinates cysteine protease rd21a to enhance clubroot resistance. Plant J. 122, e70148. doi: 10.1111/tpj.70148
Lin, E., Burns, D. J., and Gardner, R. C. (1993). Fruit developmental regulation of the kiwifruit actinidin promoter is conserved in transgenic petunia plants. Plant Mol. Biol. 23, 489–499. doi: 10.1007/BF00019297
Liu, J., Sharma, A., Niewiara, M. J., Singh, R., Ming, R., and Yu, Q. (2018). Papain-like cysteine proteases in carica papaya: lineage-specific gene duplication and expansion. BMC Genomics 19, 26. doi: 10.1186/s12864-017-4394-y
Liu, H., Hu, M., Wang, Q., Cheng, L., and Zhang, Z. (2018). Role of papain-like cysteine proteases in plant development. Front. Plant Sci. 9, 1717. doi: 10.3389/fpls.2018.01717
Lohman, K. N., GAN, S. S., John, M. C., and Amasino, R. M. (1994). Molecular analysis of natural leaf senescence in arabidopsis thaliana. Physiol. Plant 92, 322–328. doi: 10.1111/j.1399-3054.1994.tb05343.x
Long, L., Meng, X., Yu, X., Wang, S., Wang, J., and Yang, M. (2025). Genome-wide identification of the papaya-like cysteine protease family in poplar and determination of the functional role of perd19a in conferring salt tolerance. Int. J. Biol. Macromol 294, 139361. doi: 10.1016/j.ijbiomac.2024.139361
Martinez, M., Cambra, I., Carrillo, L., Diaz-Mendoza, M., and Diaz, I. (2009). Characterization of the entire cystatin gene family in barley and their target cathepsin l-like cysteine-proteases, partners in the hordein mobilization during seed germination. Plant Physiol. 151, 1531–1545. doi: 10.1104/pp.109.146019
Misas-Villamil, J. C., van der Hoorn, R. A., and Doehlemann, G. (2016). Papain-like cysteine proteases as hubs in plant immunity. New Phytol. 212, 902–907. doi: 10.1111/nph.14117
Mueller, A. N., Ziemann, S., Treitschke, S., Assmann, D., and Doehlemann, G. (2013). Compatibility in the ustilago maydis-maize interaction requires inhibition of host cysteine proteases by the fungal effector pit2. PloS Pathog. 9, e1003177. doi: 10.1371/journal.ppat.1003177
Nino, M. C., Kang, K. K., and Cho, Y. G. (2020). Genome-wide transcriptional response of papain-like cysteine protease-mediated resistance against xanthomonas oryzae pv. Oryzae in rice. Plant Cell Rep. 39, 457–472. doi: 10.1007/s00299-019-02502-1
Pierre, O., Hopkins, J., Combier, M., Baldacci, F., Engler, G., Brouquisse, R., et al. (2014). Involvement of papain and legumain proteinase in the senescence process of medicago truncatula nodules. New Phytol. 202, 849–863. doi: 10.1111/nph.12717
Rao, X., Huang, X., Zhou, Z., and Lin, X. (2013). An improvement of the 2^(-delta delta ct) method for quantitative real-time polymerase chain reaction data analysis. Biostat Bioinforma Biomath 3, 71–85.
Rawlings, N. D., Barrett, A. J., and Finn, R. (2016). Twenty years of the merops database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 44, D343–D350. doi: 10.1093/nar/gkv1118
Richau, K. H., Kaschani, F., Verdoes, M., Pansuriya, T. C., Niessen, S., Stuber, K., et al. (2012). Subclassification and biochemical analysis of plant papain-like cysteine proteases displays subfamily-specific characteristics. Plant Physiol. 158, 1583–1599. doi: 10.1104/pp.112.194001
Sainsbury, F., Varennes-Jutras, P., Goulet, M., D’Aoust, M., and Michaud, D. (2013). Tomato cystatin sl cys8 as a stabilizing fusion partner for human serpin expression in plants. Plant Biotechnol. J. 11, 1058–1068. doi: 10.1111/pbi.12098
Schultz, J., Milpetz, F., Bork, P., and Ponting, C. P. (1998). Smart, a simple modular architecture research tool: identification of signaling domains. Proc. Natl. Acad. Sci. U.S.A. 95, 5857–5864. doi: 10.1073/pnas.95.11.5857
van Wyk, S. G., Du Plessis, M., Cullis, C. A., Kunert, K. J., and Vorster, B. J. (2014). Cysteine protease and cystatin expression and activity during soybean nodule development and senescence. BMC Plant Biol. 14, 294. doi: 10.1186/s12870-014-0294-3
Wang, Y., Tang, H., DeBarry, J. D., Tan, X., Li, J., Wang, X., et al. (2012). Mcscanx: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49. doi: 10.1093/nar/gkr1293
Yuan, S., Ke, D., Li, R., Li, X., Wang, L., Chen, H., et al. (2020). Genome-wide survey of soybean papain-like cysteine proteases and their expression analysis in root nodule symbiosis. BMC Plant Biol. 20, 517. doi: 10.1186/s12870-020-02725-5
Zhang, Z. (2022). Kaks_calculator 3.0: calculating selective pressure on coding and non-coding sequences. Genomics Proteomics Bioinf. 20, 536–540. doi: 10.1016/j.gpb.2021.12.002
Zhang, C., Chen, Y., Wang, L., Liu, L., Zhong, X., Chu, P., et al. (2023). Genome-wide identification of papain-like cysteine protease family genes in cultivated peanut (arachis hypogaea l.) And functional characterization of ahrd21b in response to chilling stress. Environ. Exp. Bot. 209, 105272. doi: 10.1016/j.envexpbot.2023.105272
Zhang, B., Tremousaygue, D., Denance, N., van Esse, H. P., Horger, A. C., Dabos, P., et al. (2014). Pirin2 stabilizes cysteine protease xcp2 and increases susceptibility to the vascular pathogen ralstonia solanacearum in arabidopsis. Plant J. 79, 1009–1019. doi: 10.1111/tpj.12602
Keywords: PLCP, Nicotiana benthamiana, biological stress, family analysis, gene expression patterns
Citation: Yang Y, Deng Z, Zhang Z, Yang Y, Pan X, Wu R, Liu T, Gao X, Li L and Guo Y (2025) Papain-like cysteine proteases in Nicotiana benthamiana: gene family members and their potential implications in recombinant protein expression. Front. Plant Sci. 16:1565487. doi: 10.3389/fpls.2025.1565487
Received: 23 January 2025; Accepted: 30 May 2025;
Published: 19 June 2025.
Edited by:
Giuseppe Dionisio, Aarhus University, DenmarkReviewed by:
Mercedes Diaz-Mendoza, Complutense University of Madrid, SpainEliane Ferreira Noronha, University of Brasilia, Brazil
Copyright © 2025 Yang, Deng, Zhang, Yang, Pan, Wu, Liu, Gao, Li and Guo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yongfeng Guo, Z3VveW9uZ2ZlbmdAY2Fhcy5jbg==; Lingyan Li, bGlsaW5neWFuQHFhdS5lZHUuY24=
†These authors have contributed equally to this work and share first authorship