Abstract
The complex composition of venom, a proteinaceous secretion used by diverse animal groups for predation or defense, is typically viewed as being driven by gene duplication in conjunction with positive selection, leading to large families of diversified toxins with selective venom gland expression. Yet, the production of alternative transcripts at venom genes is often overlooked as another potentially important process that could contribute proteins to venom, and requires comprehensive datasets integrating genome and transcriptome sequences together with proteomic characterization of venom to be fully documented. In the common house spider, Parasteatoda tepidariorum, we used RNA sequencing of four tissue types in conjunction with the sequenced genome to provide a comprehensive transcriptome annotation. We also used mass spectrometry to identify a minimum of 99 distinct proteins in P. tepidariorum venom, including at least 33 latrotoxins, pore-forming neurotoxins shared with the confamilial black widow. We found that venom proteins are much more likely to come from multiple transcript genes, whose transcripts produced distinct protein sequences. The presence of multiple distinct proteins in venom from transcripts at individual genes was confirmed for eight loci by mass spectrometry, and is possible at 21 others. Alternative transcripts from the same gene, whether encoding or not encoding a protein found in venom, showed a range of expression patterns, but were not necessarily restricted to the venom gland. However, approximately half of venom protein encoding transcripts were found among the 1,318 transcripts with strongly venom gland biased expression. Our findings revealed an important role for alternative transcription in generating venom protein complexity and expanded the traditional model of venom evolution.
Introduction
Venoms exhibit a high degree of biochemical complexity, with the venom of an individual organism being composed of up to hundreds of distinct proteins or peptides (Escoubas et al., ; King and Hardy, ). Such complexity may enable the rapid immobilization of a phylogenetically divergent array of prey and simultaneous disabling of many cellular functions (Olivera, ; Ushkaryov et al., ; Casewell et al., ). This complexity also makes venoms important resources for therapeutic drug discovery (Lewis and Garcia, ; Escoubas and King, ; Netirojjanakul and Miranda, ), targeted insecticide development (Schwartz et al., ; Windley et al., ; Smith et al., ), and neurophysiological research (Ushkaryov et al., , ; Ashton, ; Südhof, ; Deak et al., ; Silva et al., ).
The typical explanation for the generation of the diverse venom protein complement proposes a dominant role for gene duplication. After an initial gene duplication event, a shift to venom gland limited expression of the duplicate gene occurs. This is followed by further gene duplication events that generate diversity through the production of venom gland restricted toxin families with functionally distinct paralogs molded by the action of positive selection (Kordis and Gubensek, ; Fry et al., ; Wong and Belov, ; Schwager et al., ). Yet, other mechanisms such as the production of alternative transcripts expressed in the venom gland and producing venom toxins via alternative transcriptional start or polyadenylation sites, or through alternative splicing, could also generate protein diversity (Nilsen and Graveley, ; Pal et al., ; de Klerk and ‘t Hoen, ). Alternative transcription could thus act to enhance venom diversity, yet has been reported in only a small number of individual cases. For example, alternative splicing has been suggested to account for novel venom protein variants in snakes, lizards, and wasps (Siigur et al., ; Fry et al., ; Viala et al., ; Yan et al., ). However, the putative alternatively spliced variants in these studies were derived from de novo assemblies of short-read data or EST sequencing, and no genomic information was available to confirm whether these variants came from the same locus. Furthermore, while sequenced genomes, together with transcriptomic or proteomic datasets, are available for a limited set of venomous organisms (e.g., de Plater et al., ; Torres et al., ; Warren et al., ; Whittington et al., , ; Kita et al., ; Sanggaard et al., ), no comprehensive exploration of the role of alternative transcription at venom genes or in venom composition has been undertaken in genome sequenced species.
In addition, the presence of alternative transcripts at venom genes may require a modified understanding of how an evolutionary shift in expression of a toxin transcript to the venom gland might be accomplished. If a gene that is duplicated has preexisting multiple transcripts, potentially only one of several alternative transcripts might shift expression to the venom gland, or such a shift to venom gland restricted expression could occur with no duplication from a gene that has multiple transcripts. In humans and other organisms, the use of alternative start or polyadenylation sites, or alternative splicing, play an important role in generating tissue-specific transcripts from existing genes (Farajzadeh et al., ; Hestand et al., ; Sanfilippo et al., ; Reyes and Huber, ). This process could provide a mechanism for altering venom composition through the generation of novel toxins expressed in the venom gland that circumvents the need for preceding gene duplication events.
Spiders are now amongst the venomous species with sequenced genomes (Sanggaard et al., ; Babb et al., ; Schwager et al., ), including the common house spider Parasteatoda tepidariorum, an emerging model for the study of arthropod development and evolution (Hilbrant et al., ). Although P. tepidariorum is a member of the spider Family Theridiidae along with the notorious black widows of the genus Latrodectus, its bites are generally not harmful to humans (Isbister and Gray, ; Isbister and White, ). Thus, this species is of importance for the study of venom evolution, as a contrast to black widow venom with its extreme toxicity to vertebrates. Additionally, a recent investigation of an earlier version of the P. tepidariorum genome assembly generated by the i5k (5,000 arthropod genomes) consortium focused on the diversity and evolution of two gene families whose proteins are abundant in black widow venom: latrotoxins and latrodectins (Gendreau et al., ), including a preliminary analysis of expression in a single venom gland RNA-Seq library. However, a full evaluation of the contribution of alternative transcriptional events at venom genes to venom diversity requires high-quality genomes together with replicated transcriptomes, as well as data on the protein composition of venom.
Here we integrated newly obtained data from multi-tissue RNA-Seq and tandem mass spectrometry (MS/MS) with an improved assembly of the recently sequenced P. tepidariorum genome (Schwager et al., ) to reconstruct transcripts at genomic loci, and characterize the complexity of this species venom by identifying the protein products of transcripts with MS/MS data from the secreted venom itself. In order to investigate the importance of alternative transcription in generating venom diversity, we tested whether alternative transcripts are characteristic of genes encoding venom proteins, and also whether alternative transcripts from the same locus may generate multiple venom toxins with distinct protein sequences or architectures. We also explored the expression patterns across tissues of alternative transcripts found at venom genes. In particular, we tested whether the expression of venom protein encoding transcripts is restricted to, or highest in, the venom gland, as predicted in a traditional model of venom evolution (Fry et al., ), and as expected given that their products are presumed to be secreted toxins. We also tested whether venom genes have all transcripts, or only a subset, that are venom gland restricted, or whether some transcripts at venom genes are primarily expressed outside of the venom gland, suggesting that they may have alternative functions.
Materials and Methods
Laboratory Protocols for RNA-Sequencing
Adult female P. tepidariorum were maintained in the laboratory on a diet of crickets, and were fed 3 days prior to being placed under 5 min of CO2 anesthesia for extraction of venom glands, silk glands, and ovaries and isolation of the cephalothorax after venom gland removal. Extraction of RNA was performed with a modified Trizol protocol, followed by column purification using an RNeasy kit. As it was necessary to pool venom glands from 11 to 12 individuals to obtain sufficient material for RNA extraction, and to average out inter-individual variability, we also combined silk glands, ovaries and cephalothoraxes from 3 individuals in each replicate. Two replicates per tissue type, for a total of eight biologically replicated RNA-Seq libraries were generated, with each replicate from a different pool of individuals.
Transcript Reconstruction and Expression Analysis
Quality control and library preparation for Illumina sequencing were performed at the Deep Sequencing Core Laboratory of the University of Massachusetts Medical Center. All cDNA libraries were sequenced using a 100 bp paired-end protocol on an Illumina HiSeq 4000. Low-quality sequence was trimmed from reads at a Phred score threshold of 20 and Illumina adapters were removed with TrimGalore 0.3.7 (https://github.com/FelixKrueger/TrimGalore) incorporating FastQC 0.11.3 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) to verify read quality: median quality scores were at least 30 across all bases in all reads. Trimmed reads were aligned to the P. tepidariorum genome assembly (NCBI Accession: AOMJ00000000.2; Schwager et al., ) using Hisat2 2.0.5 (Kim et al., ) with settings for downstream transcriptome assembly, a maximum sequence mismatch penalty of 4 and a minimum mismatch penalty of 1. To generate a more comprehensive reference annotation incorporating expression information from isolated tissue libraries, alignments were performed for each of the eight libraries from this study, two libraries (venom gland and silk gland) from a previous study (Gendreau et al., ) and a single library from ovary tissue produced by the i5k project (i5KConsortium, ). Stringtie 1.2.4 (Pertea et al., ) was then used to assemble transcripts using an existing annotation for the Schwager et al. assembly as a reference in each run. Individual assemblies from each library were merged with the reference genome annotation for P. tepidariorum to produce a final annotation, which was used for estimation of transcript abundance. All transcripts in the final merged annotation were identified by comparison to the nr database using the blastx command in Diamond 0.8.23 (Buchfink et al., ) using the best local alignment at an e-value cutoff of 1e-5. The final merged annotation was compared to the reference annotation to identify novel genes expressed from previously intergenic regions (class code “u”) using Cuffcompare in the Cufflinks suite (Roberts et al., ).
Analysis of differential expression was performed using the R package EBSeq (Leng et al., ). Read counts for use in EBSeq were generated using the prepDE script provided with Stringtie and five counts were added to each value to prevent underflow errors during calculations in R when counts were zero in both replicates for a tissue. We performed pairwise comparisons between venom gland and the three other tissues and identified transcripts with expression significantly upregulated in the venom gland at a false discovery rate of 5% in each comparison. The intersection of the three sets constituted the final set of venom gland upregulated transcripts. To confirm novel splice junctions and to calculate junction support as reads mapped to junctions present in the merged final annotation, we used STAR 2.5.0a (Dobin et al., ) using default parameters.
Protein Identification and Sequence Analysis
Milked venom was obtained from SpiderPharm (Yarnell, AZ) where it was produced by pooling venom extracted by electrostimulation from ~300 anesthetized adult female P. tepidariorum, in order to generate sufficient material and to capture the broadest sample of venom components. Lyophilized venom was separated on polyacrylamide gels and divided into 8 fractions for subsequent analysis, including a small peptide fraction. Each fraction was trypsin digested, and separated on a Thermo/Proxeon nano-HPLC apparatus. Eluted proteins were subjected to mass spectrometry using a Linear Trap Quadropole (LTQ) Velos Orbitrap tandem mass spectrometer at the University of Arizona Cancer Center Proteomics Shared Resource. Three technical replicates of this analysis were performed. A database for searching with mass spectra was produced from predicted proteins from all transcripts in the final merged annotation of the P. tepidariorum genome assembly. While polymorphism is present in the RNA-Seq reads, database transcripts were each represented by a single consensus sequence derived from the genome assembly that summarizes mappings from reads across all libraries. Proteins were predicted first by translating all open reading frames >90 nucleotides for each transcript and then using an in-house Perl script to choose the longest protein in the frame of the best blastx hit, or the longest protein in the absence of a BLAST hit. Sets of transcripts with 100% identical predicted proteins were identified using CD-hit (Li and Godzik, ) after the trimming of protein predictions to the first methionine residue. Protein domain predictions were performed with InterProScan (Quevillon et al., ). Predictions of protein toxicity were performed with Clantox (Naamati et al., ), and of inhibitory cystine knot structural folds with Knoter1d (Gracy et al., ).
Spectra were matched to proteins in the reference database using Sequest version 1.3.0.339 and the results were viewed in Scaffold 4.4.7. To consider a protein as present in the venom, peptide probabilities were set at 95% and protein probabilities at 99%, with a minimum of 2 peptide matches per protein required. The decoy false discovery rate for these settings was 0%. Proteins with shared peptides that could not be discriminated were placed into groups, while proteins that were not supported by independent evidence were deemed not present in venom. The identified protein list was purged of common contaminants (keratin, trypsin) and of a likely arthropod contaminant (hemocyanin).
Results
RNA-Seq Based Transcript Prediction Helps Reveals Venom Composition
Novel genes and transcripts were predicted from the P. tepidariorum genome assembly using eight new RNA-Seq libraries, as well as three previously published libraries (Gendreau et al., ), and merged with an existing annotation for this genome (Schwager et al., ). This resulted in a final tally of 58158 genes and 90093 transcripts (Supplementary File 1), of which 23989 novel genes and 25214 novel transcripts were defined as expressed regions of the genome that did not overlap with any existing annotated gene. However, only 4,718 (19.7%) of these putative novel genes had a BLAST hit to the nr database at an e-value of 1e-5 or better, a far lower proportion than that for the total number of genes with a BLAST hit in the entire genome (73.6%), and the function of the remaining expressed regions remains unknown.
The SDS-PAGE gels of P. tepidariorium venom showed multiple protein bands, including high molecular weight bands in the size range of latrotoxins (110–130 kDa), as well as a number of smaller bands, with the smallest in the 10–15 kDa range (Figure S1, Figure S2). After discarding contaminants, we identified via tandem mass spectrometry a minimum of 99 proteins across all gel slices in the venom of the common house spider (Table S1). Fifty-three proteins were unique identifications: a single protein that could be discriminated from other proteins in the database by the peptides identified in the experiment. However, 46 identifications were of a group of proteins that were not distinguishable by the peptide data. Further, 26 of these 46 groups contained 2–5 proteins that were distinct in sequence when compared from the putative start codon, yielding a potential maximum of 139 distinct proteins in the venom. The remaining 20 groups contained only identical proteins, but these proteins had been included in the database as they were produced by distinct transcripts.
For enumeration of types of venom components, we provide a conservative minimum estimate assuming that only a single predicted protein from a group contributed an individual identification (99 total proteins), and a maximum estimate that includes all proteins from each group (139 total) with distinct sequence, but that could not be discriminated. Proteins were assigned to toxin or other categories of interest using the BLASTx hit with lowest e-value for the transcript from which they were predicted. Latrotoxins were the most numerous type of known toxin in P. tepidariorum venom, with 33–38 different latrotoxin proteins present. However, latrodectins (small accessory proteins to latrotoxins), inhibitory cysteine knot toxins (ICKs) and cysteine-rich secretory proteins (CRISP) were not diverse in venom, with only 1–2 representatives each (Table 1; Figure 1; Table S1). There were 3–7 proteases, and 3–7 proteins corresponding to other enzymes (lipases, amylases, and hyaluronidases) in the venom. We identified 6–9 proteins with homology to leucine-rich repeat proteins (Table 1; Figure 1; Table S1), which were also identified in confamilial black widow venom (Haney et al., ). There were 24–34 distinct proteins in venom with a best BLAST hit to uncharacterized proteins (Table 1), including 8–9 proteins derived from genes in a novel family identified by Gendreau et al. () with transcripts having very high expression in P. tepidariorum venom glands, and 2–3 proteins lacking a BLAST hit. Of these 26–37 distinct uncharacterized or unknown proteins, 14–17 were < 200 amino acids in length, and rich in cysteines (6–14 Cys residues). However, zero were predicted to have an inhibitory cystine knot structural motif, although 1 was predicted as a probable toxin and 3 were labeled as possible toxins by the Clantox server (Table S1).
Table 1
| Class | # In venom | # VGTup | # In venom and VGTup |
|---|---|---|---|
| Latrotoxin | 33–38 | 51 | 21 |
| Latrodectin | 1–1 | 2 | 1 |
| CRISP | 1–2 | 3 | 1 |
| ICK | 1–1 | 3 | 1 |
| Protease | 3–7 | 25 | 4 |
| Other enzyme* | 3–7 | 17 | 4 |
| LRR | 6–9 | 11 | 1 |
| Uncharacterized | 24–34 | 144 | 19 |
| Novel family | 8–9 | 14 | 9 |
Numbers of proteins corresponding to different known toxin or other category of interest in venom, or from a transcript upregulated in the venom gland, or both, as determined by BLAST homology.
lipase, amylase, hyaluronidase. Numbers separated by dashes indicate minimum and maximum estimates of the number of proteins of a given type, given ambiguity in the MS identifications for similar proteins. To calculate the number that are both in venom and are VGTup, we used the maximum estimate of distinct proteins in venom. CRISP, cysteine-rich secretory protein; ICK, inhibitory cysteine knot toxin; LRR, leucine-rich repeat protein; Novel Family, putative toxin family identified in Gendreau et al. ().
Figure 1
Venom Genes Typically Have Multiple Transcripts and Encode Distinct Proteins
In sum, 86 genes encoded at least one protein identified in venom, 28 of which produced only a single transcript (of which 15 were latrotoxins), while 58 were multiple transcript genes. The proportion of multiple transcript genes (58/86: 67.4%) contributing to venom was significantly higher by chi-square goodness-of-fit test (χ2 = 115.82, df = 1, p < 0.0001) than the proportion of genes across the genome that encoded multiple transcripts (11922/58158: 20.5%). Identified events producing alternative transcripts at venom multiple transcript genes included alternative 5′ splice sites (7, 6.6%), alternative 3′ splice sites (3, 2.8%), exon skipping (13, 12.3%), mutually exclusive exons (26, 24.5%), alternative first exons (27, 25.5%), and alternative last exons (30, 28.3%). However, no venom gene alternative transcripts were produced through intron retention. Accounting for the presence of grouped and indistinguishable proteins, including identical proteins produced by distinct transcripts, there were 169 transcripts from these genes encoding the maximum 139 distinct venom proteins. Most of these transcripts (141, 83.4%) were encoded by the 58 multiple transcript genes (Table S2), and a slight majority (72) of these 141 transcripts were novel to this study. Together with the 141 transcripts encoding a venom protein, the 58 multiple transcript venom genes had an additional 135 predicted transcripts, for a total of 276.
These 276 transcripts produced 210 different protein sequences, and individual genes encoded 1–22 distinct variants. Forty-four of the 58 multiple transcript genes had transcripts that encoded more than one distinct protein, accounting for 196 of the 210 (Table S2: column 2). The greatest number of distinct proteins encoded at a locus occurred at genes with BLAST homology to chitinases (MSTRG.35296, 27 transcripts, 22 distinct proteins), endothelin-converting enzymes (MSTRG.35640, 22 transcripts, 20 distinct proteins), and latrotoxins (MSTRG.45528, 23 transcripts, 17 distinct proteins).
Among the 44 multiple transcript genes producing multiple protein variants via alternative transcripts, peptide evidence from the venom MS experiment was sufficient to distinguish a single protein as present in venom for 15 (Table S2, column 3). For example, gene MSTRG.32727, an inhibitory cystine knot toxin, produced four transcripts, two of which were novel to this study and which both possessed a 5′ untranslated region (UTR) bearing a highly supported intron with canonical splice sites not present in the previous genome annotation (Figure 2; Supplementary File 1; Table S3). These four transcripts produced two divergent predicted proteins (each encoded by two distinct transcripts) that differed in predicted disulfide binding pattern, an important determinant of function in ICK toxins. For this gene, however, only one of these proteins was distinguished in venom (Figure 2) by unique peptide matches.
Figure 2

A venom protein-encoding multiple transcript gene produces two distinct proteins, but only one is identified in venom. Locus MSTRG.32727 has homology to inhibitory cystine knot toxins, and has four transcripts which produce two different predicted proteins. For each of the two distinct proteins, two different transcripts encode the same protein sequence, which vary by the presence or absence of an intron bearing 5' UTR. The proteins vary in length and are divergent in sequence, with identical residues in red. Each has five predicted disulfide bonds, but the predicted pattern of connectivity is different. Transcripts labeled “V” have protein products identified in venom by MS. A “*” indicates that these proteins cannot be discriminated by the MS data. Numbers over introns indicate the number of spliced reads supporting novel junctions across all libraries. Also shown is venom gland expression in TPM rounded to the nearest whole number, and whether the transcript is upregulated in the venom gland (dark red text) relative to other tissues.
Venom Genes Can Contribute More Than One Protein to Venom
In contrast, eight of the 29 other venom multiple transcript genes producing multiple proteins contributed at least two distinguishable proteins each to the venom (Table 2, Table S2: column 4) as assessed by the MS data, accounting for 23 total distinct venom proteins (16.5% of all distinct proteins). Each of the 8 genes were from regions of the genome with more than one paralogous gene in the existing annotation of the dovetail genome assembly (Schwager et al.,
Table 2
| Gene | BLAST ID | transcripts/proteins/ proteins in venom | Genomic coordinates |
|---|---|---|---|
| MSTRG.15150 | Latrotoxin | 4/4/2 | Scf5bK3_2009: 246682-283244 |
| MSTRG.21390 | Leucine-rich repeat | 6/5/2 | Scf5bK3_258: 4313870-4351624 |
| MSTRG.2390 | None | 7/5/2 | Scf5bk3_112: 476966-508466 |
| MSTRG.25517 | Uncharacterized | 5/3/2 | Scf5bK3_306: 5509048-5555723 |
| MSTRG.32970 | Latrotoxin | 8/6/2 | Scf5bK3_440: 829934-879481 |
| MSTRG.35296 | Chitinase | 27/22/2 | Scf5bK3_479: 825520-993306 |
| MSTRG.35640 | Endothelin-converting enzyme | 22/20/4 | Scf5bK3_479: 10697475-10953136 |
| MSTRG.45528 | Latrotoxin | 23/17/7 | Scf5bK3_756: 633096- 755920 |
Basic information for eight loci that contribute more than one protein to venom as assessed by tandem mass spectrometry.
Column 3 lists the total number of transcripts predicted at the locus, the number of distinct proteins predicted from these transcripts, and the total number identified in venom.
Figure 3

A venom protein-encoding multiple transcript gene produces two venom proteins. Locus MSTRG.15150 has homology to latrotoxins, and has four transcripts (two novel to this study), which produce four different predicted proteins, which vary in UTR length and exon-intron structure. Coding regions are shaded pink, and non-coding gray. The proteins produced vary slightly in length but also in primary sequence, as enumerated in the lower table on the right, in which numbers correspond to transcript codes in the upper table. Transcripts labeled “V” have protein products identified in venom by MS. A “*” indicates that these proteins cannot be discriminated by the MS data. Numbers over introns indicate the number of spliced reads supporting novel junctions across all libraries. Also shown is venom gland expression in TPM rounded to the nearest whole number, and whether the transcript is upregulated in the venom gland (dark red text) relative to other tissues.
Figure 4

A multiple transcript gene encodes two structurally distinct proteins found in venom. (A) Locus MSTRG.21390, with homology to leucine-rich repeat proteins, has 6 distinct transcripts, four which are novel to this study, encoding 5 distinct proteins. Novel upstream exons containing coding sequence together with a short untranslated region are shaded in gray. Transcripts labeled “V” have protein products identified in venom by MS. A “*” indicates that these proteins cannot be discriminated by the MS data. Numbers over introns indicate the number of spliced reads supporting novel junctions across all libraries. Also shown is venom gland expression in TPM rounded to the nearest whole number, and whether the transcript is upregulated in the venom gland (dark red text) relative to other tissues. (B) Three different protein domain architectures were predicted from protein primary sequences at this locus, and the first two shown were identified in venom. The leucine-rich repeat locations and size are identical for MSTRG.21390.3 and MSTRG.21390.2, but LRR L-domain start and end positions differ by 1 basepair, so these sequences are considered to have the same architecture. (C) Sequence of the novel protein from transcript MSTRG.21390.3 indicating peptides mapped to the sequence from MS data (orange).
For example, gene MSTRG.15150, a latrotoxin, had four transcripts in our updated annotation, two of which were present in the previous annotation, and which were annotated as separate genes lacking UTRs (Figure 3). The two novel transcripts from this study show novel TSS and UTRs with three previously undetected introns. These introns possessed canonical splice sites and were highly supported by spliced reads (Figure 3; Table S3). Each of the four transcripts produced a unique protein. One transcript novel to this study, MSTRG.15150.1, encompassed the entire region, and combined segments of two distinct coding regions to produce a novel protein sequence. The other, MSTRG.15150.2, produced a longer predicted coding sequence distinct from transcript aug3.g26325.t1, which it subsumed, via an upstream start codon. At least two, and possibly three, proteins predicted from this locus were identified in venom (Figure 3). The protein from transcript aug3.g26326.t1 was identified, and while the peptide data indicated that the protein produced by transcript MSTRG.15150.1 was not in the venom, the proteins produced by MSTRG.15150.2 and aug3.g26325.t1 could not be discriminated, and hence one or both may be present.
Distinct Venom Proteins From Novel Transcripts Are Supported by MS Data
In two cases, transcripts novel to this study received support from the MS data. First, a leucine-rich repeat protein gene, MSTRG.21390, contributed at least two distinct proteins to venom. This locus had six transcripts (four novel to this study), which in total produced five distinct proteins (three novel to this study). Again, these novel transcripts included UTRs not present in the original annotation, as well as novel exons and introns with canonical splice sites, which were generally highly supported by spliced reads (Figure 4; Table S3; Supplementary File 1). Three of these novel exons added coding sequence at the 5′ end of the transcript (Figure 4). At least two, and possibly three of the proteins produced, with different arrangements of leucine-rich repeats, were found in venom (Figure 4B), including the novel protein predicted from transcript MSTRG.21390.3, which spanned genes from the original annotation, combining exons into a novel combination, and was uniquely identified by MS data. Novel transcript MSTRG.21390.1 produces a protein distinct from one encoded by two different transcripts (MSTRG.21390.5 and aug3.g6329.t1) due to 7 amino acids predicted from a novel upstream exon. However, the two proteins are otherwise identical and cannot be discriminated by peptides in the current MS data set, and hence one or both may be present in venom in addition to the protein from MSTRG.21390.3.
A second novel predicted protein sequence, from the chitinase gene MSTRG.35296, was uniquely identified in venom. This protein was one of two distinct proteins from this locus that were identified in venom, and was encoded by two novel transcripts (Figure S3) that spanned and shared exons with three genes in the original annotation (Figure S3; Supplementary File 1). Although this locus was transcriptionally complex, with 27 transcripts (Figure S3; Supplementary File 1) and 22 proteins, for the most part novel introns, which contributed to the generation of protein variation, were highly supported by spliced reads and possessed canonical splice sites (Table S3).
Alternative Transcripts May Further Contribute to Venom Protein Diversity
An additional 21 (of 29) genes produced from 2 to 9 different proteins, of which a minimum of 1 and a maximum of 4 (Table S2, column 3) per gene were potentially present in venom. However, although the potential venom proteins were distinct in sequence, they were only matched by shared peptides. In sum, these 21 genes may contribute a minimum of 21 and a maximum of 63 proteins to the venom. Gene MSTRG.6569 illustrates this result, having four transcripts (two novel) that produce four distinct proteins that cannot be differentiated by peptides derived from the MS experiment (Figure S9), in addition to four other novel transcripts that do not produce a venom-identified protein. As in previous examples, novel transcripts were generally highly supported by overall read counts and by spliced reads across novel exon-exon junctions (Table S3). This phenomenon was also observed among the eight genes contributing more than one protein to venom. Five of these genes (MSTRG.25517, MSTRG.15150, MSTRG.21390, MSTRG.2390, MSTRG.32970) also produce additional distinct proteins that may be present in venom, but could not be differentiated by the MS data, as only shared peptides were matched. Thus, up to seven additional distinct venom proteins may be produced by these loci.
Venom Gene Transcripts Show Variable Spatial Expression Patterns
Reads from eight sequenced libraries from venom glands, silk glands, ovaries, and cephalothorax (with venom glands removed) were mapped to the genome (Table S4), and used to identify 1,318 venom gland upregulated transcripts (VGTup), transcripts with a pattern of expression highly biased toward the venom gland (Table S5), which came from 1,095 genes. This is a substantially higher number than the 355 genes with a VGTup found in a previous study (Gendreau et al.,
This pattern of expression was observed at both single and multiple transcript venom protein encoding genes. Of the 28 single transcript genes that encoded a protein identified in venom (Table S6), only 14 (50%) had transcripts that were significantly upregulated in venom gland. These 14 transcripts, including 4 latrotoxins (Table S6), were among the most highly expressed transcripts in venom gland, with 11 in the top 1% of venom gland expression rank, and all 14 in the top 5% (average TPM = 278.3), although several showed some level of expression in other tissues (Table S6). However, the remaining 14 transcripts, although also producing venom proteins, mostly exhibited low expression across tissues (Table S6), including the venom gland (average TPM = 0.6), although 11 were latrotoxins. Furthermore, six of these transcripts had no measurable expression in venom gland (TPM = 0).
At multiple transcript venom genes, less than half of all venom protein-encoding transcripts exhibited venom gland upregulation (66 of 141: 46.8%) and high abundance in the venom gland (Table S6; Figure 5A), with 56 transcripts in the top 1% of venom gland expression and 63 of 66 in the top 5% (average TPM = 4697.6), including 18 latrotoxins (Table S6). Yet, as with single transcript venom genes, 75 other transcripts from venom multiple transcript genes that encoded a venom protein were not venom gland upregulated, and were generally lowly expressed in all tissues, with median TPM < 1 (Figure 5B). Furthermore, 27 had zero expression in venom gland (Table S6), including eight latrotoxin transcripts. However, 29 (37.2%) of these 75 transcripts that do not exhibit venom gland upregulation produced a protein that belongs to a group of venom proteins indiscriminable by our MS data (Table S1, Table S6), in which at least one other transcript is venom gland upregulated. Hence, it is possible in these cases that the venom gland upregulated transcript actually produces the protein found in venom. Also, 12 of these 75 transcripts, while not significantly upregulated, do have their highest average expression in the venom gland (Table S6). Surprisingly, however, nine of these transcripts that were not grouped at protein level with a VGTup actually showed relatively substantial expression in all tissues, and 30 had higher average expression in at least one other tissue than in venom gland.
Figure 5

Average expression (TPM) across tissues for transcripts from multiple transcript genes encoding at least one protein identified in venom. Top panels are for transcripts encoding a venom identified protein while the bottom panels show the range of values for transcripts from the same set of loci whose encoded proteins could not be identified in venom. Panels (A,C) show values for transcripts that were significantly upregulated in venom gland relative to the other three tissues, while panels (B,D) show values for transcripts that were not significantly upregulated in venom gland. Black bars show median values, while the interquartile range is indicated by the boundaries of the colored boxes. Whiskers are 1.5x the interquartile range (IQR) and dots indicate outliers beyond 1.5 × the IQR.
Multiple transcript genes that contribute a protein to venom also had from 0 to 23 transcripts whose encoded protein was excluded from being present in venom by the current MS data (Table S2), which in theory could be transcripts that have an alternative function to that of a toxin. Yet, the pattern of expression of these transcripts is similar to those that do produce a venom protein. Approximately half (68 of 135: 50.4%) of these additional transcripts at venom protein encoding loci are significantly upregulated and abundant in the venom gland (Table S6; Figure 5C), with 48 in the top 1% of venom gland expression and 65 in the top 5%. The 67 remaining transcripts at these genes, not significantly upregulated in venom gland, are generally lowly expressed (< 1 TPM) on average in all tissues (Table S6; Figure 5D), and although 13 had highest average expression in the venom gland, 22 transcripts had 0 TPM in the venom gland. Five of these transcripts showed expression > 5 TPM in all tissues, and 41 had higher average expression in at least one other tissue than in venom gland. Three had zero or very low expression in the venom gland, and expression in at least one other tissue that was at least an order of magnitude greater.
Discussion
Common House Spider Venom Is a Diverse Mixture
Animal venoms are complex, heterogeneous solutions, and often contain a diverse suite of molecules (Casewell et al.,
Alternative Transcripts and Venom Diversity
The presence of products from numerous toxin genes of the same type in P. tepidariorum venom is consistent with the standard notion of gene duplication as a predominant force in the diversification of venom toxins (Kordis and Gubensek,
The involvement of alternative transcripts in the generation of venom protein diversity were most clearly demonstrated by eight genes defined in the new annotation developed for this study that had more than one protein unambiguously identified in venom by mass spectrometry, as these genes together accounted for 23 distinct venom proteins. However, in the pre-existing genome annotation, these regions contained multiple distinct paralogous genes (Schwager et al.,
Furthermore, the contribution from protein sequence variants at venom multiple transcript genes may be more extensive than those identified from these eight genes. For most venom multiple transcript genes, a level of ambiguity remained as to which of several proteins occurred in the venom, as the only peptides identified as present in the venom by MS are shared among related proteins. Hence, while these genes may contribute only a single protein to venom, the possibility remains that some are actually contributing more than one distinct protein and adding to venom complexity, and in general the transcripts producing protein variants are highly supported by RNA-Seq. The MS data is restricted to a single experiment, and the augmentation of the available proteomic data might aid in resolving this ambiguity, and in confirming whether additional proteins from these genes, including those at low levels, are present in common house spider venom.
Many Venom Gland Selective Transcripts Are Not in Venom
An expectation of the traditional model of venom evolution is that a switch to selective venom gland expression of a new toxin gene occurs subsequent to its origination via duplication from a non-venom “body” protein gene that fulfills some typical physiological role (Kordis and Gubensek,
Transcripts at Venom Genes Are Not Always Expressed Selectively in Venom Gland
In contrast with the selective expression in the venom gland as posited in the traditional model of venom evolution (Fry et al.,
However, interpretation of patterns of transcript expression is complicated by the potential influence of environmental conditions on gene expression, and venom gland expression of some transcripts could potentially be enhanced by certain environmental stimuli, including exposure to different prey communities (Daltry et al.,
In summary, through the generation and use of genomic, transcriptomic and proteomic data we find that venom complexity may be achieved through both gene duplication and complex patterns of alternative transcription. Patterns of transcript expression do not comport with simple predictions from a traditional model of venom evolution, and may reflect the interplay of environmental and genetic mechanisms. Though this study focused on a single spider species, we expect these findings will be broadly applicable across venomous taxa in explaining how diverse venom complements are generated.
Statements
Author contributions
RH and JG conceived and designed the study. RH and JG performed laboratory procedures. RH, FF, and TM performed the analysis. RH wrote the first draft of the manuscript. All authors contributed to manuscript revision, and read and approved the submitted version.
Acknowledgments
Evelyn Schwager for assistance in spider dissections, Chuck Kristensen for spider venom extraction, George Tsaparalis and Linda Breci for mass spectrometry analysis, and Jonathan Coddington and Nico Posnien for access to the Dovetail genome assembly and annotation. This work was supported by funding from the National Institutes of Health (GM097714-01, GM097714-02) to JG.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fevo.2019.00085/full#supplementary-material
References
1
AkivaP. (2005). Transcription-mediated gene fusion in the human genome. Genome Res.16, 30–36. 10.1101/gr.4137606
2
AshtonA. C. (2001). alpha -Latrotoxin, acting via two Ca2+-dependent pathways, triggers exocytosis of two pools of synaptic vesicles. J. Biol. Chem.276, 44695–44703. 10.1074/jbc.M108088200
3
BabbP. L.LahensN. F.Correa-GarhwalS. M.NicholsonD. N.KimE. J.HogeneschJ. B.et al. (2017). The Nephila clavipes genome highlights the diversity of spider silk genes and their complex expression. Nat. Genet.49, 895–903. 10.1038/ng.3852
4
BinfordG. (2001). Differences in venom composition between orb-weaving and wandering Hawaiian Tetragnatha (Araneae). Biol. J. Linn. Soc.74, 581–595. 10.1111/j.1095-8312.2001.tb01415.x
5
BorgesM. H.FigueiredoS. G.LeprevostF. V.De LimaM. E.CordeiroM.doN.DinizM. R. V.et al. (2016). Venomous extract protein profile of Brazilian tarantula Grammostola iheringi : searching for potential biotechnological applications. J. Proteom.136, 35–47. 10.1016/j.jprot.2016.01.013
6
BrinkmanD. L.JiaX.PotriquetJ.KumarD.DashD.KvaskoffD.et al. (2015). Transcriptome and venom proteome of the box jellyfish Chironex fleckeri. BMC Genom.16:407. 10.1186/s12864-015-1568-3
7
BuchfinkB.XieC.HusonD. H. (2014). Fast and sensitive protein alignment using DIAMOND. Nat. Methods12, 59–60. 10.1038/nmeth.3176
8
CalveteJ. J. (2017). Venomics: integrative venom proteomics and beyond. Biochem. J.474, 611–634. 10.1042/BCJ20160577
9
CasewellN. R.WagstaffS. C.WusterW.CookD. A. N.BoltonF. M. S.KingS. I.et al. (2014). Medically important differences in snake venom composition are dictated by distinct postgenomic mechanisms. Proc. Natl. Acad. Sci. U.S.A.111, 9205–9210. 10.1073/pnas.1405484111
10
CasewellN. R.WüsterW.VonkF. J.HarrisonR. A.FryB. G. (2013). Complex cocktails: the evolutionary novelty of venoms. Trends Ecol. Evol.28, 219–229. 10.1016/j.tree.2012.10.020
11
ColinetD.AnselmeC.DeleuryE.ManciniD.PoulainJ.Azéma-DossatC.et al. (2014). Identification of the main venom protein components of Aphidius ervi, a parasitoid wasp of the aphid model Acyrthosiphon pisum. BMC Genom.15:342. 10.1186/1471-2164-15-342
12
DaltryJ. C.WüsterW.ThorpeR. S. (1996). Diet and snake venom evolution. Nature379, 537–540. 10.1038/379537a0
13
de KlerkE.‘t HoenP. A. C. (2015). Alternative mRNA transcription, processing, and translation: insights from RNA sequencing. Trends Genet.31, 128–139. 10.1016/j.tig.2015.01.001
14
de PlaterG.MartinR. L.MilburnP. J. (1995). A pharmacological and biochemical investigation of the venom from the platypus (Ornithorhynchus anatinus). Toxicon Off. J. Int. Soc. Toxinol.33, 157–169. 10.1016/0041-0101(94)00150-7
15
DeakF.LiuX.KhvotchevM.LiG.KavalaliE. T.SugitaS.et al. (2009). α-Latrotoxin stimulates a novel pathway of Ca2+-dependent synaptic exocytosis independent of the classical synaptic fusion machinery. J. Neurosci.29, 8639–8648. 10.1523/JNEUROSCI.0898-09.2009
16
DobinA.DavisC. A.SchlesingerF.DrenkowJ.ZaleskiC.JhaS.et al. (2013). STAR: ultrafast universal RNA-seq aligner. Bioinformatics29, 15–21. 10.1093/bioinformatics/bts635
17
DuanZ.CaoR.JiangL.LiangS. (2013). A combined de novo protein sequencing and cDNA library approach to the venomic analysis of Chinese spider Araneus ventricosus. J. Proteom.78, 416–427. 10.1016/j.jprot.2012.10.011
18
DuanZ.YanX.CaoR.LiuZ.WangX.LiangS. (2008). Proteomic analysis of Latrodectus tredecimguttatus venom for uncovering potential latrodectism-related proteins. J. Biochem. Mol. Toxicol.22, 328–336. 10.1002/jbt.20244
19
DutertreS.JinA. H.VetterI.HamiltonB.SunagarK.LavergneV.et al. (2014). Evolution of separate predation- and defence-evoked venoms in carnivorous cone snails. Nat. Commun.5:3521. 10.1038/ncomms4521
20
EscoubasP.KingG. F. (2009). Venomics as a drug discovery platform. Exp. Rev. Proteom.6, 221–224. 10.1586/epr.09.45
21
EscoubasP.SollodB.KingG. F. (2006). Venom landscapes: mining the complexity of spider venoms via a combined cDNA and mass spectrometric approach. Toxicon47, 650–663. 10.1016/j.toxicon.2006.01.018
22
FarajzadehL.HornshøjH.MomeniJ.ThomsenB.LarsenK.HedegaardJ.et al. (2013). Pairwise comparisons of ten porcine tissues identify differential transcriptional regulation at the gene, isoform, promoter and transcription start site level. Biochem. Biophys. Res. Commun.438, 346–352. 10.1016/j.bbrc.2013.07.074
23
Frenkel-MorgensternM.LacroixV.EzkurdiaI.LevinY.GabashviliA.PriluskyJ.et al. (2012). Chimeras taking shape: potential functions of proteins encoded by chimeric RNA transcripts. Genome Res.22, 1231–1242. 10.1101/gr.130062.111
24
FryB. G.RoelantsK.ChampagneD. E.ScheibH.TyndallJ. D. A.KingG. F.et al. (2009). The toxicogenomic multiverse: convergent recruitment of proteins into animal venoms. Annu. Rev. Genom. Hum. Genet.10, 483–511. 10.1146/annurev.genom.9.081307.164356
25
FryB. G.RoelantsK.WinterK.HodgsonW. C.GriesmanL.KwokH. F.et al. (2010). Novel venom proteins produced by differential domain-expression strategies in beaded lizards and gila monsters (genus Heloderma). Mol. Biol. Evol.27, 395–407. 10.1093/molbev/msp251
26
GacesaR.ChungR.DunnS. R.WestonA. J.Jaimes-BecerraA.MarquesA. C.et al. (2015). Gene duplications are extensive and contribute significantly to the toxic proteome of nematocysts isolated from Acropora digitifera (Cnidaria: Anthozoa: Scleractinia). BMC Genom. 16:774. 10.1186/s12864-015-1976-4
27
GendreauK. L.HaneyR. A.SchwagerE. E.WierschinT.StankeM.RichardsS.et al. (2017). House spider genome uncovers evolutionary shifts in the diversity and expression of black widow venom proteins associated with extreme toxicity. BMC Genom.18:178. 10.1186/s12864-017-3551-7
28
GibbsH. L.SanzL.ChiucchiJ. E.FarrellT. M.CalveteJ. J. (2011). Proteomic analysis of ontogenetic and diet-related changes in venom composition of juvenile and adult Dusky Pigmy rattlesnakes (Sistrurus miliarius barbouri). J. Proteom.74, 2169–2179. 10.1016/j.jprot.2011.06.013
29
GracyJ.Le-NguyenD.GellyJ. C.KaasQ.HeitzA.ChicheL. (2007). KNOTTIN: the knottin or inhibitor cystine knot scaffold in 2007. Nucleic Acids Res.36, D314–D319. 10.1093/nar/gkm939
30
GraveleyB. R. (2001). Alternative splicing: increasing diversity in the proteomic world. Trends Genet. TIG17, 100–107. 10.1016/S0168-9525(00)02176-4
31
GrishinE. V. (1998). Black widow spider toxins: the present and the future. Toxicon36, 1693–1701. 10.1016/S0041-0101(98)00162-7
32
HaneyR. A.AyoubN.ClarkeT. H.HayashiC. Y.GarbJ. E. (2014). Dramatic expansion of the black widow toxin arsenal uncovered by multi-tissue transcriptomics and venom proteomics. BMC Genom.15:366. 10.1186/1471-2164-15-366
33
HaneyR. A.ClarkeT. H.GadgilR.FitzpatrickR.HayashiC. Y.AyoubN. A.et al. (2016). Effects of gene duplication, positive selection, and shifts in gene expression on the evolution of the venom gland transcriptome in widow spiders. Genome Biol. Evol.8, 228–242. 10.1093/gbe/evv253
34
HargreavesA. D.SwainM. T.HegartyM. J.LoganD. W.MulleyJ. F. (2014). Restriction and recruitment—gene duplication and the origin and evolution of snake venom toxins. Genome Biol. Evol.6, 2088–2095. 10.1093/gbe/evu166
35
HeY.YuanC.ChenL.LeiM.ZellmerL.HuangH.et al. (2018). Transcriptional-Readthrough RNAs reflect the phenomenon of “A Gene Contains Gene(s)” or “Gene(s) within a Gene” in the human genome, and thus are not chimeric RNAs. Genes9:E40. 10.3390/genes9010040
36
HestandM. S.ZengZ.ColemanS. J.LiuJ.MacLeodJ. N. (2015). Tissue restricted splice junctions originate not only from tissue-specific gene loci, but gene loci with a broad pattern of expression. PLoS ONE10:e0144302. 10.1371/journal.pone.0144302
37
HilbrantM.DamenW. G. M.McGregorA. P. (2012). Evolutionary crossroads in developmental biology: the spider Parasteatoda tepidariorum. Dev. Camb. Engl.139, 2655–2662. 10.1242/dev.078204
38
HimayaS. W. A.JinA. H.DutertreS.GiacomottoJ.MohialdeenH.VetterI.et al. (2015). Comparative venomics reveals the complex prey capture strategy of the piscivorous cone snail Conus catus. J. Proteome Res.14, 4372–4381. 10.1021/acs.jproteome.5b00630
39
i5KConsortium (2013). The i5K initiative: advancing arthropod genomics for knowledge, human health, agriculture, and the environment. J. Hered.104, 595–600. 10.1093/jhered/est050
40
IsbisterG. K.GrayM. R. (2003). Latrodectism: a prospective cohort study of bites by formally identified redback spiders. Med. J. Aust.179, 88–91. 10.5694/j.1326-5377.2003.tb05499.x
41
IsbisterG. K.WhiteJ. (2004). Clinical consequences of spider bites: recent advances in our understanding. Toxicon43, 477–492. 10.1016/j.toxicon.2004.02.002
42
KimD.LangmeadB.SalzbergS. L. (2015). HISAT: a fast spliced aligner with low memory requirements. Nat. Methods12, 357–360. 10.1038/nmeth.3317
43
KingG. F.HardyM. C. (2013). Spider-venom peptides: structure, pharmacology, and potential for control of insect pests. Annu. Rev. Entomol.58, 475–496. 10.1146/annurev-ento-120811-153650
44
KitaM.BlackD. S.OhnoO.YamadaK.KigoshiH.UemuraD. (2009). Duck-billed platypus venom peptides induce Ca2+ influx in neuroblastoma cells. J. Am. Chem. Soc.131, 18038–18039. 10.1021/ja908148z
45
KordisD.GubensekF. (2000). Adaptive evolution of animal toxin multigene families. Gene261, 43–52. 10.1016/S0378-1119(00)00490-X
46
LengN.DawsonJ. A.ThomsonJ. A.RuottiV.RissmanA. I.SmitsB. M. G.et al. (2013). EBSeq: an empirical Bayes hierarchical model for inference in RNA-seq experiments. Bioinformatics29, 1035–1043. 10.1093/bioinformatics/btt087
47
LewisR. J.GarciaM. L. (2003). Therapeutic potential of venom peptides. Nat. Rev. Drug Discov.2, 790–802. 10.1038/nrd1197
48
LiW.GodzikA. (2006). Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics22, 1658–1659. 10.1093/bioinformatics/btl158
49
LiaoZ.CaoJ.LiS.YanX.HuW.HeQ.et al. (2007). Proteomic and peptidomic analysis of the venom from Chinese tarantula Chilobrachys jingzhao. Proteomics7, 1892–1907. 10.1002/pmic.200600785
50
ManiatisT.TasicB. (2002). Alternative pre-mRNA splicing and proteome expansion in metazoans. Nature418, 236–243. 10.1038/418236a
51
NaamatiG.AskenaziM.LinialM. (2009). ClanTox: a classifier of short animal toxins. Nucleic Acids Res.37, W363–W368. 10.1093/nar/gkp299
52
NetirojjanakulC.MirandaL. P. (2017). Progress and challenges in the optimization of toxin peptides for development as pain therapeutics. Curr. Opin. Chem. Biol.38, 70–79. 10.1016/j.cbpa.2017.03.004
53
NilsenT. W.GraveleyB. R. (2010). Expansion of the eukaryotic proteome by alternative splicing. Nature463, 457–463. 10.1038/nature08909
54
OliveraB. M. (1997). E.E. Just Lecture, 1996. Conus venom peptides, receptor and ion channel targets, and drug design: 50 million years of neuropharmacology. Mol. Biol. Cell8, 2101–2109. 10.1091/mbc.8.11.2101
55
PalS.GuptaR.KimH.WickramasingheP.BaubetV.ShoweL. C.et al. (2011). Alternative transcription exceeds alternative splicing in generating the transcriptome diversity of cerebellar development. Genome Res.21, 1260–1272. 10.1101/gr.120535.111
56
PalagiA.KohJ. M. S.LeblancM.WilsonD.DutertreS.KingG. F.et al. (2013). Unravelling the complex venom landscapes of lethal Australian funnel-web spiders (Hexathelidae: Atracinae) using LC-MALDI-TOF mass spectrometry. J. Proteom.80, 292–310. 10.1016/j.jprot.2013.01.002
57
PerteaM.PerteaG. M.AntonescuC. M.ChangT. C.MendellJ. T.SalzbergS. L. (2015). StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol.33, 290–295. 10.1038/nbt.3122
58
QuevillonE.SilventoinenV.PillaiS.HarteN.MulderN.ApweilerR.et al. (2005). InterProScan: protein domains identifier. Nucleic Acids Res.33, W116–W120. 10.1093/nar/gki442
59
ReyesA.HuberW. (2018). Alternative start and termination sites of transcription drive most transcript isoform differences across human tissues. Nucleic Acids Res.46, 582–592. 10.1093/nar/gkx1165
60
RobertsA.PimentelH.TrapnellC.PachterL. (2011). Identification of novel transcripts in annotated genomes using RNA-Seq. Bioinformatics27, 2325–2329. 10.1093/bioinformatics/btr355
61
RohouA.NieldJ.UshkaryovY. A. (2007). Insecticidal toxins from black widow spider venom. Toxicon49, 531–549. 10.1016/j.toxicon.2006.11.021
62
SanfilippoP.WenJ.LaiE. C. (2017). Landscape and evolution of tissue-specific alternative polyadenylation across Drosophila species. Genome Biol.18:229. 10.1186/s13059-017-1358-0
63
SanggaardK. W.BechsgaardJ. S.FangX.DuanJ.DyrlundT. F.GuptaV.et al. (2014). Spider genomes provide insight into composition and evolution of venom and silk. Nat. Commun.5:3765. 10.1038/ncomms4765
64
SanzL.GibbsH. L.MackessyS. P.CalveteJ. J. (2006). Venom proteomes of closely related Sistrurus rattlesnakes with divergent diets. J. Proteome Res.5, 2098–2112. 10.1021/pr0602500
65
SchwagerE. E.SharmaP. P.ClarkeT.LeiteD. J.WierschinT.PechmannM.et al. (2017). The house spider genome reveals an ancient whole-genome duplication during arachnid evolution. BMC Biol.15:62. 10.1186/s12915-017-0399-x
66
SchwartzE. F.MourãoC. B. F.MoreiraK. G.CamargosT. S.MortariM. R. (2012). Arthropod venoms: a vast arsenal of insecticidal neuropeptides. Biopolymers98, 385–405. 10.1002/bip.22100
67
SiigurE.AaspõlluA.SiigurJ. (2001). Sequence diversity of Vipera lebetina snake venom gland serine proteinase homologs–result of alternative-splicing or genome alteration. Gene263, 199–203. 10.1016/S0378-1119(00)00571-0
68
SilvaJ. P.SucklingJ.UshkaryovY. (2009). Penelope's web: using α-latrotoxin to untangle the mysteries of exocytosis. J. Neurochem.111, 275–290. 10.1111/j.1471-4159.2009.06329.x
69
SmithJ. J.HerzigV.KingG. F.AlewoodP. F. (2013). The insecticidal potential of venom peptides. Cell. Mol. Life Sci.70, 3665–3693. 10.1007/s00018-013-1315-3
70
SüdhofT. C. (2001). alpha-Latrotoxin and its receptors: neurexins and CIRL/latrophilins. Annu. Rev. Neurosci.24, 933–962. 10.1146/annurev.neuro.24.1.933
71
TangX.ZhangY.HuW.XuD.TaoH.YangX.et al. (2010). Molecular diversification of peptide toxins from the tarantula Haplopelma hainanum (Ornithoctonus hainana) venom based on transcriptomic, peptidomic, and genomic analyses. J. Proteome Res.9, 2550–2564. 10.1021/pr1000016
72
TorresA. M.De PlaterG. M.DoverskogM.Birinyi-StrachanL. C.NicholsonG. M.GallagherC. H.et al. (2000). Defensin-like peptide-2 from platypus venom: member of a class of peptides with a distinct structural fold. Biochem. J.348, 649–656. 10.1042/bj3480649
73
UndheimE. A. B.JonesA.ClauserK. R.HollandJ. W.PinedaS. S.KingG. F.et al. (2014). Clawing through evolution: toxin diversification and convergence in the ancient lineage chilopoda (Centipedes). Mol. Biol. Evol.31, 2124–2148. 10.1093/molbev/msu162
74
UshkaryovY. A.PetrenkoA. G.GeppertM.SüdhofT. C. (1992). Neurexins: synaptic cell surface proteins related to the alpha-latrotoxin receptor and laminin. Science257, 50–56. 10.1126/science.1621094
75
UshkaryovY. A.VolynskiK. E.AshtonA. C. (2004). The multiple actions of black widow spider toxins and their selective use in neurosecretion studies. Toxicon43, 527–542. 10.1016/j.toxicon.2004.02.008
76
VialaV. L.HildebrandD.FucaseT. M.ScianiJ. M.Prezotto-NetoJ. P.RiednerM.et al. (2015). Proteomic analysis of the rare Uracoan rattlesnake Crotalus vegrandis venom: evidence of a broad arsenal of toxins. Toxicon107, 234–251. 10.1016/j.toxicon.2015.09.023
77
WarrenW. C.HillierL. W.Marshall GravesJ. A.BirneyE.PontingC. P.GrütznerF.et al. (2008). Genome analysis of the platypus reveals unique signatures of evolution. Nature453, 175–183. 10.1038/nature06936
78
WhittingtonC. M.PapenfussA. T.BansalP.TorresA. M.WongE. S. W.DeakinJ. E.et al. (2008). Defensins and the convergent evolution of platypus and reptile venom genes. Genome Res.18, 986–994. 10.1101/gr.7149808
79
WhittingtonC. M.PapenfussA. T.LockeD. P.MardisE. R.WilsonR. K.AbubuckerS.et al. (2010). Novel venom gene discovery in the platypus. Genome Biol.11:R95. 10.1186/gb-2010-11-9-r95
80
WindleyM. J.HerzigV.DziemborowiczS. A.HardyM. C.KingG. F.NicholsonG. M. (2012). Spider-venom peptides as bioinsecticides. Toxins4, 191–227. 10.3390/toxins4030191
81
WongE. S. W.BelovK. (2012). Venom evolution through gene duplications. Gene496, 1–7. 10.1016/j.gene.2012.01.009
82
YanS.WangX. (2015). Recent advances in research on widow spider venoms and toxins. Toxins7, 5055–5067. 10.3390/toxins7124862
83
YanZ.FangQ.LiuY.XiaoS.YangL.WangF.et al. (2017). A venom serpin splicing isoform of the endoparasitoid wasp Pteromalus puparum suppresses host prophenoloxidase cascade by forming complexes with host hemolymph proteinases. J. Biol. Chem.292, 1038–1051. 10.1074/jbc.M116.739565
84
YuanC.JinQ.TangX.HuW.CaoR.YangS.et al. (2007). Proteomic and peptidomic characterization of the venom from the Chinese bird spider, ornithoctonus huwena wang. J. Proteom. Res.6, 2792–2801. 10.1021/pr0700192
Summary
Keywords
venom, transcriptome, toxins, Parasteatoda, alternative transcript
Citation
Haney RA, Matte T, Forsyth FS and Garb JE (2019) Alternative Transcription at Venom Genes and Its Role as a Complementary Mechanism for the Generation of Venom Complexity in the Common House Spider. Front. Ecol. Evol. 7:85. doi: 10.3389/fevo.2019.00085
Received
04 November 2018
Accepted
06 March 2019
Published
24 April 2019
Volume
7 - 2019
Edited by
Sebastien Dutertre, Centre National de la Recherche Scientifique (CNRS), France
Reviewed by
Jianqiang Wu, Kunming Institute of Botany (CAS), China; Marjorie A. Lienard, Broad Institute, United States
Updates

Check for updates
Copyright
© 2019 Haney, Matte, Forsyth and Garb.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Robert A. Haney robert.a.haney@gmail.com
This article was submitted to Chemical Ecology, a section of the journal Frontiers in Ecology and Evolution
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.