Identification and Expression Analysis of G Protein-Coupled Receptors in the Miridae Insect Apolygus lucorum

G protein-coupled receptors (GPCRs) are the largest and most versatile family of transmembrane receptors in the cell and they play a vital role in the regulation of multiple physiological processes. The family Miridae (Hemiptera: Heteroptera) is one of the most diverse families of insects. Until now, information on GPCRs has been lacking in Miridae. Apolygus lucorum, a representative species of the Miridae, is an omnivorous pest that occurs worldwide and is notorious for causing serious damage to various crops and substantial economic losses. By searching the genome, 133 GPCRs were identified in A. lucorum. Compared with other model insects, we have observed GPCR genes to be remarkably expanded in A. lucorum, especially focusing on biogenic amine receptors and neuropeptide receptors. Among these, there is a novel large clade duplicated from known FMRFamide receptors (FMRFaRs). Moreover, the temporal and spatial expression profiles of the 133 genes across developmental stages were determined by transcriptome analysis. Most GPCR genes showed a low expression level in the whole organism of A. lucorum. However, there were a few highly expressed GPCR genes. The highly expressed LW opsins in the head probably relate to nocturning of A. lucorum, and the expression of Cirl at different times and in different tissues indicated it may be involved in growth and development of A. lucorum. We also found C2 leucine-rich repeat-containing GPCRs (LGRs) were mainly distributed in Hemiptera and Phthiraptera among insects. Our study was the first investigation on GPCRs in A. lucorum and it provided a molecular target for the regulation and control of Miridae pests.


INTRODUCTION
G protein-coupled receptors (GPCRs) are in a large family of protein cell surface receptors that detect molecules outside the cell and activate cellular responses (1,2). GPCRs are found only in eukaryotes, namely, yeast, choanoflagellates, and animals (3). Based on sequence homology and functional similarity, GPCRs can be grouped into six families (4,5): Family-A (rhodopsin-like); Family-B (secretin receptor family); Family-C (metabotropic glutamate/pheromone); Family-D (fungal mating pheromone receptors); Family-E (cyclic AMP receptors); and Family-F (frizzled/smoothened). These receptors are involved in a wide variety of physiological processes (6), namely, visual sensation (7), taste (8), smell sensation (9), behavioral and mood regulation (10), regulation of immune system activity and inflammation (11,12), and autonomic nervous system transmission (10,13). Because of their crucial roles in the regulation of multiple physiological processes, GPCRs are an important drug target (14) and approximately 34% (15) of all Food and Drug Administration (FDA) approved drugs target 108 members of this family.
With the continuous innovation of next-generation sequencing technology and bioinformatics, systematic identification research about GPCRs has been reported in several insects (16)(17)(18)(19)(20)(21). Among Hemiptera, it has been reported in Acyrthosiphon pisum (22), Aphis craccivora (23), Cimex lectularius (24), Diaphorina citri (25), Nilaparvata lugens (26), and Rhodnius prolixus (27). The family Miridae (Hemiptera: Heteroptera), which includes plant bugs, leaf bugs, or grass bugs, is one of the most diverse families of insects, including over 11,000 species in more than 1,300 genera (4,28). It is the largest family of true bugs belonging to Hemiptera and new members of Miridae are being described constantly. Mirids exhibit a wide range of food preferences and behaviors, including phytophagy, carnivory, and omnivory. Some mirids exhibit significant economic impacts and some are pests of food and fiber crops, whereas others are beneficial species used as biological control agents (29). Although Miridae is the largest family of Hemiptera and exhibits a complex habit, there has been little information reported on GPCRs.
Apolygus lucorum (Miridae) is an omnivorous pest that occurs worldwide and is notorious for the serious damage it causes in various crops and its substantial economic losses (30,31). Recently, the genome of A. lucorum had been reported, which provided convenient in-depth studies of this pest (32). In the present research, using bioinformatics analysis, we screened the genes encoding GPCRs from the genome of A. lucorum. The expression profiles of all GPCRs were also determined by using public transcriptome data. These results allowed us to make comparisons of GPCR systems in different insect species and to provide relevant information for further functional studies in A. lucorum. Our study was the first investigation of GPCRs in A. lucorum, which may become the basis for further investigation of the function of miridae GPCRs.

Identification of A. lucorum GPCRs
A. lucorum protein sequences were retrieved from the NCBI Genome database (https://www.ncbi.nlm.nih.gov/assembly/ GCA_009739505.2/) (32). Based on previous studies and records in Flybase (http://www.flybase.org/) (33), the GPCRs of Drosophila melanogaster (34), A. pisum (22), Bombyx mori (17), Tribolium castaneum (16), and Pediculus humanus humanus (18) were collected. By using D. melanogaster GPCRs as references and A. lucorum protein sequences as queries, BLASTP searches (35) were performed with a cut-off e-value of 1e−5 to look for all GPCR candidates. Then, seven-transmembrane (7TM) domain and annotation information was adopted as the basic criteria for all GPCR candidates. The GPCR candidates in which the number of 7TM domains was more than four or the annotation information indicated it was a GPCR were retained. The remaining GPCR candidates were also confirmed by means of BLASTX analysis in the UniProtKB/Swiss-Prot database. Using all GPCRs that we collected, pre-phylogenetic analysis with the maximum likelihood method was the final criteria to remove non-GPCRs from candidate pools. If a candidate showed fewer genetic relationships with known GPCRs by phylogenetic analysis and the hit sequences in BLASTX analysis indicated they were not GPCRs, they were classified as a non-GPCR and removed from our analysis.

Structural Analyses, Annotation Information, and Gene Locations of GPCRs
NCBI Sequence Read Archive (SRA) database (https://www.ncbi. nlm.nih.gov/sra/) (30,41), which included egg and different tissues (leg, head, body, mouthpart, wing, and gut) of nymphs and adults. Each tissue contained three biological replicates. In detail, we downloaded the SRA data first and then we used an SRA-Toolkit to split the paired-end reads. Clean reads were obtained from the raw data using Trimmomatic (42) to remove reads with quality scores lower than 10 and adapter sequences. To analyze gene expression profiles, clean reads of each sample were mapped to A. lucorum gene sets using hisat2 (43), and then the TPM value (44) of each putative GPCR gene was calculated with featureCounts (45). These TPM expression values were scaled and served to generate a cross-sample normalized trimmed mean of the M-values (TMM) gene expression matrix (46). Finally, the heatmap was drawn in ITOL (https://itol.embl. de) (47) using the normalized matrix. The value used for each sample was the mean of three independent biological replicates.

Classification of Gene Duplication Types
MCScanX (48) was used to classified the duplication types of different duplicate GPCR genes. First, the homology with different genes in the genome of A. lucorum was determined by a whole-genome BLASTP analysis with a max target seqs of 5 and a cut-off e-value of 1e−5. Then, the homology with different genes and the chromosomal location were combined and all genes were classified into various types, including the segmental duplication, and tandem duplication. Finally, the duplication types of GPCRs were extracted based on these results. All visualized works were accomplished in TBtools (49).

RESULTS
A total of 133 putative GPCRs were identified in A. lucorum. These GPCRs were classified into four families and included 98 family-A members, 21 family-B members, 10 family-C members, and four family-F members (Tables 1-4 and Table S1). Based on the protein sequences, phylogenetic trees were reconstructed for each GPCR family/subfamily of A. lucorum, R. prolixus, A. pisum, and D. melanogaster. All GPCRs were quantified with the TPM values obtained from transcriptomic data. The expression profile of each GPCR across developmental stages was also present in the phylogenetic trees of each GPCR family/ subfamily (Figures 1-3, and Figures S1-S3). The chromosomal locations of all GPCRs are shown in

Family-A GPCRs
Insect family-A GPCRs include opsins, biogenic amine receptors, neuropeptide and protein hormone receptors, and purine GPCRs (17,18,22). In this study, 98 family-A GPCRs were identified in the genome of A. lucorum, and these receptors were composed of seven opsins, 30 biogenic amine receptors, 58 neuropeptide and protein hormone receptors, and three purine GPCRs (Tables 1, 2).

Opsins
Color vision in insects is based on the expression of different opsins in photoreceptive cells. Opsins are members of the family-A GPCRs and are coupled to light-sensitive chromophores in animal photoreceptors (50). Three groups of opsins have been reported in D. melanogaster: one related to long-wavelength (LW) vision (including Rh1, Rh2, and Rh6), another group related to short-wavelength (SW) vision (Rh3, Rh4, and Rh5), and a third group including only Rh7 (34,51). A fourth group of invertebrate opsins, named pteropsins, has been found in Apis mellifera (50) and R. prolixus (27), which was missing from the genome of D. melanogaster and A. pisum.
In this study, seven putative opsins were identified in A. lucorum. The phylogenetic analysis suggested that A1 and A2 are related to the LW opsin, A3 is related to the SW opsin, A4 and A5 belong to a third group, and A6 and A7 are close to pteropsins ( Figure 1). Four groups of invertebrate opsins were also identified in the A. lucorum genome. According to the expression profile, opsins were expressed at the highest levels in the head and mouthpart tissue, which is corresponding to their biological function. Among the four types of opsins detected in A. lucorum, the A1 showed the highest expression in the head tissue of adults with a transcripts per kilobase of exon model per million mapped reads (TPM) of 28,787 ( Figure 5) followed by A3 with a TPM of 770.

Biogenic Amine Receptors
The known biogenic amines that act as ligands for GPCRs in insects contain acetylcholine, dopamine, serotonin, octopamine, and tyramine (27). Here, we identified 30 biogenic amine receptors in A. lucorum. Based on phylogenetic analysis and sequence similarity, A8-11 are receptors for acetylcholine; A12-16 are dopamine-like receptors; A17-24 are orthologs of the octopamine receptors; A26-33 and A35-36 were identified as the serotonin-like receptors; and A25 is the GPCR that could be stimulated by two structurally related endogenous ligands, octopamine and tyramine ( Figure S1). Additionally, A34 and A37 are orphan receptors of this subfamily in A. lucorum, and are orthologs of RPRC011175 and CG13579, respectively. However, two tyramine receptors (TyrR and TyrRII) are likely to be missing in all three heteropteran insects. A25 is the only tyramine receptor in A. lucorum. Compared with opsins, the expression level of biogenic amine receptors is much lower. A36 showed the highest expression in gut tissues of adults with a TPM of 11. In FlyBase (33), we found 5-HT7, the ortholog gene of A36 in D. melanogaster, was also expressed in the digestive system.
The expression levels of neuropeptide and protein hormone receptors were higher than in biogenic amine receptors. The expression of A65 (LGR) in the bodies of nymphs was the highest in this subfamily with a TPM of 41. Moreover, A50 (CCAP-R) and A86 (moody) showed high expression levels in multiple tissues (TPM >10 at least in five tissues).

Purine GPCRs
Only one receptor in this subfamily, adenosine receptor (AdoR), has been previously classified in this subfamily (17,53). Here, three putative A. lucorum GPCRs (A96-98) were identified as AdoR, whereas there was only one member in D. melanogaster and A. pisum ( Table 2). Purine GPCRs are activated by the binding of purine nucleotides or their derivatives (principally adenosine or ADP/ATP) (54,55). Duplication of AdoR suggests that purinergic neural transmission may play a more important role in A. lucorum.

Family-B GPCRs
Family-B GPCRs play vital roles in many biological processes, including growth, development, and reproduction. They are characterized by long N-terminal domains, and they form a small group of receptors that are structurally and functionally divergent from other groups of GPCRs (56). Within this family,     family-B GPCRs can be further subdivided into three subfamilies: B1-B3, which are greatly divergent in both function and structure. In total, 21 family-B GPCRs were identified from the genome of A. lucorum in this study, which consisted of nine B1 subfamily members, three B2 subfamily members, and nine B3 subfamily members ( Table 3 and Figure 3). The B1 subfamily is made of largely classical hormone receptors. It comprises three types of hormone receptors in D. melanogaster: diuretic hormone 31 receptor (DH31-R/hector), CRF-like diuretic hormone 44 (DH44-R), and pigment dispersing factor receptor (Pdfr) (57)(58)(59)(60). In our study, all three types of hormone receptors were identified. The parathyroid hormone receptor (PTHR), which is involved in the calcium and phosphate homeostasis and bone growth in vertebrates, is also a subfamily-B1 GPCR (58). There are two PTHRs in mammals, which are involved in calcium and phosphate homeostasis and bone growth (61,62). In insects, PTHR-like (PTHRL) have been identified from T. castaneum, A. mellifera, P. h. humanus, and N. lugens (18,26,57), but its counter-parts in D. melanogaster, B. mori, A. pisum, and A. gambiae are not found (17,22,34,63). T. castaneum has two distinct PTHRLs (57), N. lugens possesses a pair of homologous PTHRLs (26), and A. mellifera only has one PTHRL (57). In our study, we also identified one PTHRL, B9, which shared a low evalue (1e−104 and 1e−111) with two PTHRLs in N. lugens. These results showed that genes coding for PTHR are divergent among insects.
The B2 subfamily is characterized by a long extracellular Nterminal domain and a GPCR proteolytic site (57,58,64). Based on phylogenetic analysis and sequence similarity, three receptors (B10-12) were classified in the B2 subfamily, which correspond to a calcium-independent receptor for a-latrotoxin (Cirl), starry night (stan), and CG15744, respectively. However, the orthologs for CG11318 and CG15556 were not identified in the genome of A. lucorum.
There is only one group of receptors in the B3 subfamily; i.e., Methuselah (mth)/Methuselah-like (mthl) (57,58). This gene family is involved in the modulation of life span and stress responses. No counterpart for the mth gene family has been identified in vertebrates (57). In insects, the number in the B3  subfamily is highly variable (18,57). In D. melanogaster, this subfamily can be divided into two groups based on their structure, 12 mth ectodomain-positive members (mth, mthl2, mthl3, mthl4, mthl6, mthl7, mthl8, mthl9, mthl10, mthl11, mthl12, and mthl13) and four mth ectodomain-negative members (mthl1, mthl5, mthl14, and mthl15) (65). In our study, nine receptors (B13-21) were identified in this family. Based on phylogenetic analysis, B13 and B20 may belong to the  mth ectodomain-positive group, and others may be members of the mth ectodomain-negative group. At the mRNA level, B13, B18, and B21 showed a higher expression level than other mthl members, which indicated these three members of the B3 subfamily may play a more important role in A. lucorum.

Family-C GPCRs
Family-C GPCRs possess a large ligand-binding extracellular domain and form constitutive dimers (34,(66)(67)(68). There are three types of GPCRs in family-C, the glutamate and g-amino butyric acid (GABA-B) receptors, the bride of sevenless (bosstype) receptors, and the metabotropic glutamate (mGlu) receptors. Until now, nine family-C GPCRs from D. melanogaster and seven from A. pisum have been reported. By using these reference sequences, 10 family-C members (Table S1 nad Figure S2) of A. lucorum were identified here. Like A. pisum (22), there are two GABA-B receptors (C1 and C2) in A. lucorum. C1 shares a 78% sequence similarity with D. melanogaster GABA-B-R1, while C2 has a 44% sequence identity with D. melanogaster GABA-B-R2. The orthologous gene to D. melanogaster GABA-B-R3 has not been found in two Hemiptera insects. The boss-type receptor was first identified as a ligand for sevenless tyrosine kinase, which was involved in eye differentiation in D. melanogaster. Subsequently, boss has been implicated in the glucose-response (69,70). It has been reported in D. melanogaster (34), A. gambiae (63), A. pisum (22), and T. castaneum (16), but not in B. mori (17), A. mellifera, N. vitripennis, and P. humanus corporis. Here, C3 was identified as the orthologous gene to boss. These results indicated that boss has been randomly lost in insects during their evolutionary process. Moreover, three mGlu receptors (C4, C5, and C6) were found in A. lucorum, whereas there are only two mGlu receptors in D. melanogaster (34) and one mGlu receptor in A. pisum (22). Small expansions of A. lucorum mGlu receptors have been observed. There are some unclassified receptors in this family. C7 was the orthologous gene to smog. C8 and C9 showed 53 and 38% sequence identities with their counterparts in D. melanogaster, respectively. As shown in Figure S2, C9 showed a high expressed level in egg, adult head, and all nymph tissues, except leg, while the function of its orthologous gene is unclear. C10 was an orphan receptor that has not been reported in D. melanogaster and A. pisum (22,34).

Family-F GPCRs
Family-F GPCRs comprise the frizzled gene family and the smoothened gene (34,71). In this study, four putative A. lucorum GPCRs were identified in the frizzled/smoothened GPCR family, which are orthologous to D. melanogaster fz, fz2, fz3, and smo (Table S1 and Figure S3). Our results indicated that orthologs for D. melanogaster fz4 were missing in A. lucorum. Among family-F, F4 (smo) showed the highest expression in the egg with a TPM of 61.

DISCUSSION
In this study, we systematically identified 133 GPCRs from A. lucorum. Compared with other model insects, we also found the GPCR genes remarkably expanded among the biogenic amine receptors, neuropeptide and protein hormone receptors, and the B1 subfamily ( Table 4). Some of them had been reported in R. prolixus or C. lectularius (24,27), such as the duplications of 5-HT7, Lkr, PK1-R, ETHR, CCAP-R, NPFR, SIFaR, and DH31-R. However, missing TyrR, Tre1, TrissinR, MsR, and some orphan receptors has also been observed in the genome of A. lucorum (Tables S2, S3). All these predicted GPCRs were quantified by transcriptome data. Although most GPCR genes showed a low expression level in A. lucorum, there were a few highly expressed GPCR genes, such as the long-wavelength opsin and Cirl. By comparative analysis, we also found C2 LGR types were widely distributed in Hemiptera. All these aspects will be discussed in detail below.

GPCRs Gene Expansion Occurred in A. lucorum
Compared with other well-studied insects, we noticed that the number of genes coding for GPCRs is obviously larger than for other insects, especially expanded among the biogenic amine receptors, neuropeptide and protein hormone receptors, and B1 subfamily ( Table 4). There were 26 GPCRs duplicated in A. lucorum. Twenty-three of them were classified into the three subfamilies mentioned above. By MCScanX analysis, we found six tandem duplication events occurred among Rh6, Rh7, AstA-R, ETHR, FMRFaR, and AdoR, while most GPCR genes duplicated dispersedly. Considering the location of GPCR genes on chromosomes, it suggested that the duplication of GPCR genes mainly occurred as independent duplications and transitions ( Figure 4).
The duplicate biogenic amine receptors in A. lucorum included 5-HT1A, 5-HT1B, 5-HT2A, 5-HT2B, 5-HT7, Dop2R, mAChR-B, Oamb, Octbeta2R, and Octbeta3R. These biogenic amine receptors can regulate many behaviors including flight and fight, learning and memory, sleep and wakefulness, feeding, and social and reproductive behaviors (72)(73)(74). For example, 5-HT1A was related to locomotor activity in B. mori. Injecting the antagonist of the Bm5-HT1A receptor into larvae caused slow or weak motility, and adults had lowered courtship vitality or moving speed (75). mAChRs have also been reported to be critical in regulation of locomotory behavior in Drosophila (76). In addition, 5-HT1B mediates hemocyte phagocytosis and serotonergic signaling performs critical modulatory functions in immune systems. Moreover, 5-HT7 and Dop2R were shown   to be associated with learning ability (77,78) and octopamine receptors were required for ovulation in D. melanogaster (79). The ligands of neuropeptide and protein hormone receptors and the B1 subfamily belong to neuropeptides, which also play an important role in the regulation of development, reproduction, feeding, courtship, aggression, olfaction, locomotor activity, circadian rhythm, and many other physiological processes in insects (21,80,81). The gene expansion of these three subfamilies indicated that A. lucorum had a more complex peptidergic signaling system. Expansions of genes associated with omnivorousness and mesophyll feeding, such as those related to digestion, chemosensory perception, and detoxification, were also observed in A. lucorum (32). Gustatory receptors (Grs) and odorant receptors (Ors) are thought to be the most important chemosensory receptors. Like GPCRs, Ors, and Grs are seven-transmembrane domain receptors but belong to the chemosensory 7tm receptor superfamily (82,83). It has been suggested that the cause of gene expansion in GPCRs might be similar to that of chemosensory receptors, also to better adapt to the environment (32). A. lucorum is found in natural and agricultural ecosystems throughout the world (30), and many of them are generalists, exhibiting diverse feeding habits or preferences (e.g., feeding on leaf, stem, inflorescences, nectar, pollen, and fruit) (32). These results indicated the complex peptidergic signaling system is more favorable for A. lucorum to adapt to multiple living environments and multiple hosts.

A. lucorum Appears to Have Evolved From a Novel Large Clade of Known FMRFaRs
FMRFamide (FMRFa) is a cardioexcitatory peptide that was first isolated from the nervous system of the clam, Macrocallista nimbosa (84), and is active as a tetrapeptide only in mollusks and annelids. Since the discovery of FMRFa, peptides with extended length at the N-terminal portion have been reported, such as myosuppressin (Ms) (85). Here, nine receptors (A54-62) displayed a certainly evolutionary kinship with the FMRFaR of D. melanogaster, A. pisum, C. lectularius, and R. prolixus, while the MsR is missing in A. lucorum (Figure 2). By reconstruction of the phylogenetic tree of MsR and FMRFaR with more species (17,86), we found these receptors were closer to the known FMRFaRs (Figure 7). Among these, A54 and A55 were clustered in a single clade with the FMRFaRs that had been identified in other insects, whereas the other seven receptors are clustered on the other single clade. The most similar proteins in UniProtKB/ Swiss-Prot of these receptors are both D. melanogaster FMRFaRs (CG2114). Among these receptors, A54 was the ortholog to the insect FMRFaRs with the smallest e-value of 1.75E−130, and A55, which was adjacent located on chromosome 9, is a tandem duplication that occurred at the beginning of this receptor expansion. However, the other seven receptors (A56-62) were scattered across six chromosomes, which indicated these seven receptors might have arisen from transposition.
By searching in the genome of other heteropteran insects, we found there are only one or two FMRFaRs in each heteropteran species. We suggest A54 and A55 should be classified as FMRFaRs, while the others (A56-62) were named as FMRFaRlike for the moment. This branch might be another unknown GPCR or even contain the MsR. Recent research had found that FMRFaR stimulates intracellular calcium signaling through the IP3R and helps maintain neuronal excitability in a subset of dopaminergic neurons for positive modulation of flight bout durations (87), and the ligand can reduce spontaneous muscle contractions, such as in the intestinal muscle and the heart rate, which also have an impact on movements (88,89). A. lucorum has great flight capacity and its adults can fly 151.3 km within 48 h (90). A large FMRFaR-like branch evolved in A. lucorum may help it maintain strong flight capability.

Only a Few GPCR Genes Showed High Expression Levels in A. lucorum
In our research, only 23 GPCR genes (17.3% of all GPCR genes) were expressed highly (TPM >10) in at least one tissue, while most GPCR genes showed a low expression level in A. lucorum. This result is consistent with previous studies in which most GPCRs showed a low endogenous expression level, even at the mRNA level (91)(92)(93). Certainly, a low expression level of GPCRs does not necessarily equate to functional insignificance (94).
Here, through transcriptome analysis, we found the expression levels of opsins were higher than in other subfamilies. Opsins that originated early in metazoan evolution mediate the response to visual stimuli primarily. When stimulated by light, opsins can activate a downstream signaling cascade by conformational change (95). Most opsin genes are expressed in photoreceptors, but there are opsins expressed in other tissues, suggesting some nonvisual functions (96,97). In A. lucorum, opsin genes (A1-3) were expressed highly not only in head but also in leg, wing, and mouthpart, indicating these opsins may execute some nonvisual functions ( Figure 1). Among these, the LW opsin (A1) showed the highest expression levels in the head tissue of adults (TPM = 28,787, at least 20 times more than other GPCR; Figure 5). The peak absorbance of the LW opsin is 500-600 nm, which corresponds to yellow-green light. As night sets in, the natural ambient light is increasingly dominated by longer wavelengths (98,99). The importance of LW opsin had been reported in many nocturnal insects (100,101). The adults of the A. lucorum were mainly active from dusk to early morning (102). High expression levels of LW opsin may help the organism adapt to a low light environment. We found B10, orthologous to Cirl, is the most widely expressed GPCR gene, which can be tested in all tissues ( Figure 5). Cirl belongs to a unique branch of GPCRs and, specifically, is an adhesion GPCR (103,104). The orthologs of Cirl have been discovered in almost all animals from invertebrates to vertebrates, including humans (105). There are three homologs of Cirl in most vertebrates (Cirl-1, Cirl-2, and Cirl-3) and two in birds and worms, whereas there is only one homolog in insects-which is most homologous to vertebrate Cirl-2 (103,106). The expression pattern of insects Cirl, which had been reported to be expressed in multiple tissues (103,107), was also like vertebrate Cirl-2 (108). Although there is only one Cirl member in insect species, Cirl is still involved in multiple physiological processes, which can regulate sensory, developmental, reproductive, and immune functions in insects (104,109). Here, B10 had been detected in all transcriptomic samples and its distribution range is wider than in T. castaneum and D. melanogaster (103,106). This kind of expression pattern suggested B10 is crucial in the development of A. lucorum.

Type C2 LGRs Are Mainly Distributed in Hemiptera and Phthiraptera Insects
Within the neuropeptide and protein hormone receptor subfamily, LGRs are a distinct subgroup with important functions in development and reproduction (110). Three distinct types of LGRs have been defined based on their structural characteristics and they are distinguished by the number of leucine-rich repeat (LRR) motifs, the absence or presence of a low density lipoprotein receptor domain class A (LDLa) motif, and their type-specific hinge region. Generally, type B LGRs have about twice the number of LRRs compared to the other two types. An exclusive feature of the type C LGRs is the presence of at least one LDLa motif in the ectodomain. The more general type containing only one LDLa will be referred to as type C1, whereas type C2 contained more than one LDLa (111).
Type C2 LGRs were first discovered in echinoderms, mollusks, and in one insect species (Pediculus humanis corporis). In our study, we found it existed in all hemipteran insects that we studied. Combining recent work, we mentioned that type C2 LGRs are reported in many hemipteran insects and P. h. humanus (18,23,25,26), and are lost in other orders of insects (16,17,63,86) (Figure 6). Until now, the presence of type C2 LGRs have been found in all Hemiptera insects in which their GPCRs have been identified (22,23,25,26). Among insects, Phthiraptera is one of the orders most closely related to Hemiptera (112). Type C2 LGRs may be present in the common ancestor of these two orders. To clarify the distribution of type C2 LGRs in insects, we checked all protein sequences in the non-redundant protein sequences database (nr) of NCBI. The result showed, except for Hemiptera and Phthiraptera, type C2 LGRs were also founded in Zootermopsis nevadensis of Blattodea. In terms offunctionality, LGRs have important functions in development and reproduction. Type A LGRs and type B LGRs are stimulated by large dimeric protein hormones (110), regulating the adult eclosion of insects, and cuticle tanning (113,114), while type C1 LGRs are the receptors of insulin-like peptide 7 and insulinlike peptide 8 and they coordinate organ growth in D. melanogaster (115)(116)(117). At present, the function of type C2 LGRs is undetermined. The function of type C2 LGRs and the existence of type C2 LGRs in Phthiraptera need to be explored in future work.

DATA AVAILABILITY STATEMENT
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.