Abstract
Dopamine is an important central nervous system transmitter that functions through two classes of receptors (D1 and D2) to influence a diverse range of biological processes in vertebrates. With roles in regulating neural activity, behavior, and gene expression, there has been great interest in understanding the function and evolution dopamine and its receptors. In this study, we use a combination of sequence analyses, microsynteny analyses, and phylogenetic relationships to identify and characterize both the D1 (DRD1A, DRD1B, DRD1C, and DRD1E) and D2 (DRD2, DRD3, and DRD4) dopamine receptor gene families in 43 recently sequenced bird genomes representing the major ordinal lineages across the avian family tree. We show that the common ancestor of all birds possessed at least seven D1 and D2 receptors, followed by subsequent independent losses in some lineages of modern birds. Through comparisons with other vertebrate and invertebrate species we show that two of the D1 receptors, DRD1A and DRD1B, and two of the D2 receptors, DRD2 and DRD3, originated from a whole genome duplication event early in the vertebrate lineage, providing the first conclusive evidence of the origin of these highly conserved receptors. Our findings provide insight into the evolutionary development of an important modulatory component of the central nervous system in vertebrates, and will help further unravel the complex evolutionary and functional relationships among dopamine receptors.
Introduction
Dopamine (DA) is an important neurotransmitter that functions in the central nervous system of vertebrates. DA functions through two main classes of receptors, designated D1 (Class 1) and D2 (Class 2), to influence neural activity, behavior, and gene expression (Stoof and Kebabian, ; Beaulieu and Gainetdinov, ). All DA receptors are integral membrane proteins and belong to the rhodopsin family of G protein-coupled receptors, but the two classes are structurally, functionally, and genetically distinct. D1 receptors consist of a single exon open reading frame with no introns, contain a short third cytoplasmic loop and long cytoplasmic C-terminal stretch, and activate adenylyl cyclase resulting in increased cAMP levels (Callier et al., ; Le Crom et al., ; Yamamoto et al., ). Conversely, the D2 receptors possess multiple exons and introns, have a long third cytoplasmic loop and a short C-terminus stretch that remains anchored in the membrane, and reduced adenysyl cyclase activity (Callier et al., ; Le Crom et al., ). The source of DA and the expression of its associated receptors are well conserved between birds and mammals, and are important in brain modulation of many behaviors, including reproduction, learning, vocalization, addiction, and reward, making them of particular interest in both health and agricultural applications (Schnell et al., ; Chaiseha et al., ; Sasaki et al., ; Sartsoongnoen et al., ; Kubikova et al., ).
Dopamine and its receptors have been studied both functionally and evolutionarily. The D1 family is currently known to include four paralogous receptors DRD1A (human DRD1), DRD1B/X (human DRD5), DRD1C/D, and DRD1E (Callier et al., ; Yamamoto et al., ). The D2 family includes three paralogous receptors DRD2, DRD3, and DRD4 (Callier et al., ). Not all dopamine receptors are present in all species, and lineage-specific duplications have increased receptor number in some species (Yamamoto and Vernier, ). For example, only two D1 receptors DRD1A and DRD1B have been identified in mammals, while in birds a combination of DRD1A, DRD1B, DRD1C, and/or DRD1E may be present (Kubikova et al., ; Yamamoto et al., ). In the teleost fishes, DRD1B has duplicated and formed the DRD1X paralog (Yamamoto et al., ). It is commonly accepted that the two classes of vertebrate dopamine receptors are no more closely related to each other than to other classes of monoamine receptors, suggesting that they likely converged upon binding dopamine (Callier et al., ; Yamamoto and Vernier, ; Robertson et al., ; Yamamoto et al., ). However, questions still remain regarding the complex functional and evolutionary relationships between the receptors of each class.
Here, we identified and characterized D1 (DRD1A, DRD1B, DRD1C, and DRD1E) and D2 (DRD2, DRD3, and DRD4) dopamine receptor gene families in 43 recently sequenced bird genomes spanning the avian family tree at the ordinal level (Table 1) (Zhang et al., ). We used phylogenetic relationships, macro- and microsynteny, and comparisons to homologous receptors in other vertebrate genomes to make significant advances in our understanding of the evolutionary origins of D1 and D2 receptors among birds and other vertebrates.
Table 1
| Species | Common name | NCBI bioproject accession | Missing gene | Microsynteny | BLAST coverage | Citation |
|---|---|---|---|---|---|---|
| T. guttata | Zebra Finch | PRJNA17289 | DRD1E | Unidentified | N/A | Warren et al., |
| G. fortis | Medium Ground Finch | PRJNA156703 | DRD4 | Unidentified | Good | Zhang et al., |
| C. brachyrhynchos | American Crow | PRJNA212869 | N/A | N/A | N/A | Zhang et al., |
| M. vitellinus | Golden Collared Manakin | PRJNA212872 | DRD1C | Absent | Poor (30%) | Zhang et al., |
| DRD1E | Absent | Good | ||||
| A. chloris | Rifleman | PRJNA212877 | DRD4 | Unidentified | Good | Zhang et al., |
| M. undulatus | Budgerigar | PRJEB1588 | DRD1E | Absent | Poor (34%) | Ganapathy et al., |
| N. notabilis | Kea | PRJNA212900 | N/A | N/A | N/A | Zhang et al., |
| F. peregrinus | Peregrine Falcon | PRJNA159791 | N/A | N/A | N/A | Zhan et al., |
| C. cristata | Red Legged Seriema | PRJNA212889 | DRD4 | Unidentified | Good | Zhang et al., |
| M. numbicus | Northern Carmine Bee Eater | PRJNA212898 | DRD4 | Unidentified | Good | Zhang et al., |
| P. pubescens | Downy Woodpecker | PRJNA212874 | N/A | N/A | N/A | Zhang et al., |
| B. silvestris | Javan Rhinoceros Hornbill | PRJNA212887 | DRD4 | Unidentified | Good | Zhang et al., |
| A. vittatum | Bar Tailed Trogon | PRJNA212878 | DRD4 | Unidentified | Good | Zhang et al., |
| L. discolor | Cuckoo Roller | PRJNA212897 | N/A | N/A | N/A | Zhang et al., |
| C. striatus | Speckled Mousebird | PRJNA212892 | DRD1E | Unidentified | Good | Zhang et al., |
| DRD4 | Unidentified | Good | ||||
| T. alba | Barn Owl | PRJNA212909 | DRD4 | Unidentified | Good | Zhang et al., |
| H. albicilla | White Tailed Eagle | PRJNA212896 | DRD1B | Unidentified | Good | Zhang et al., |
| P. crispus | Dalmatian Pelican | PRJNA212901 | DRD4 | Unidentified | Good | Zhang et al., |
| N. nippon | Crested Ibis | PRJNA232572 | N/A | N/A | N/A | Zhang et al., |
| P. carbo | Creat Black Cormorant | PRJNA212903 | DRD1C | Absent | Poor (34%) | Zhang et al., |
| DRD4 | Unidentified | Good | ||||
| F. glacialis | Northern Fulmar | PRJNA212894 | N/A | N/A | N/A | Zhang et al., |
| P. adeliae | Adelie Penguin | PRJNA235983 | N/A | N/A | Good | Zhang et al., |
| G. stellata | Red Throated Loon | PRJNA212895 | DRD4 | Unidentified | Good | Zhang et al., |
| P. lepturus | White Tailed Tropicbird | PRJNA212902 | DRD4 | Unidentified | Good | Zhang et al., |
| E. helias | Sunbittern | PRJNA212893 | DRD4 | Unidentified | Good | Zhang et al., |
| C. vociferous | Killdeer | PRJNA212867 | DRD4 | Unidentified | Good | Zhang et al., |
| O. hoazin | Hoatzin | PRJNA212873 | DRD4 | Unidentified | Good | Zhang et al., |
| C. anna | Anna's Hummingbird | PRJNA212866 | DRD1C | Absent | Poor (15%) | Zhang et al., |
| DRD1E | Unidentified | Poor (35%) | ||||
| DRD4 | Unidentified | Good | ||||
| C. pelagica | Chimney Swift | PRJNA210808 | DRD4 | Unidentified | Good | Zhang et al., |
| A. carolinensis | Night Jar | PRJNA212888 | N/A | N/A | N/A | Zhang et al., |
| C. macqueenii | Macqueen Bustard | PRJNA212891 | DRD4 | Unidentified | Good | Zhang et al., |
| T. erythrolophus | Angola Turaco | PRJNA212908 | DRD4 | Unidentified | Good | Zhang et al., |
| M. unicolor | Brown Mesite | PRJNA212899 | DRD4 | Unidentified | Good | Zhang et al., |
| P. guturalis | Yellow Throated Sandgrouse | PRJNA212906 | DRD4 | Unidentified | Good | Zhang et al., |
| C. livia | Domestic Pigeon | PRJNA167554 | DRD4 | Unidentified | Poor (33%) | Shapiro et al., |
| P. ruber | Caribbean Flamingo | PRJNA212904 | N/A | N/A | N/A | Zhang et al., |
| P. cristatus | Great Crested Grebe | PRJNA212905 | DRD4 | Unidentified | Good | Zhang et al., |
| G. gallus | Chicken | PRJNA13342 | DRD1E | Unidentified | Good | Hillier et al., |
| M. gallopavo | Turkey | PRJNA42129 | DRD4 | Unidentified | N/A | Dalloul et al., |
| A. domestica | Peking Duck | PRJNA46621 | DRD4 | Unidentified | Good | Huang et al., |
| T. guttatus | White Throated Tinamou | PRJNA212876 | DRD1E | Unidentified | Good | Zhang et al., |
| DRD4 | Unidentified | Good | ||||
| S. camelus | Ostrich | PRJNA212875 | DRD4 | Unidentified | Good | Zhang et al., |
Avian genome information and summary of gene absence confirmation by microsynteny and BLAST searches of raw sequencing reads.
All bird species, including both scientific and common names, investigated in this study. Table is organized in order of bird phylogeny (Figure 2). Each species includes a reference to the original sequencing publication, the NCBI BioProject accession number for the raw data, and a summary of evidence supporting absence or lack of identification for each unidentified receptor. For microsynteny analyses, “Absent” refers to gene appearing deleted in microsynteny analyses, while “Unidentified” indicates sequence similarity but no intact gene annotation. BLAST coverage summarizes the extent to which raw reads covered a query receptor sequence when raw sequencing data (obtained from NCBI) was BLAST searched. “Poor” coverage (≤ 35%) suggests receptor is absent, while “good” (>35%) coverage suggests receptor may be present. In cases where a receptor is indicated as missing, but raw reads coverage is listed as “N/A,” raw reads for that species were either unavailable or corrupted and thus the analysis could not be performed on that species (i.e., Zebra Finch). Raw results and additional explanation are compiled in Supplementary File 1.
Materials and methods
Sequence data
To identify putative dopamine receptors across the avian lineage, we used the DNA and protein coding sequences of previously identified D1 and D2 receptors in Chicken, Turkey, and Zebra Finch to BLAST against 43 unmasked avian genome sequences from the Avian Phylogenomics Consortium (Jarvis et al., ; Zhang et al., ). The 43 species were chosen based on being representative of all major orders of modern birds, and having a now well-vetted species-level phylogenetic trees (Jarvis et al., ). All avian BLAST query reference sequences were obtained from the Ensembl Database v81 (Flicek et al., ). Accession numbers are listed in Table 3. BLAST searches were performed with CoGeBlast (Lyons and Freeling, ), and results were filtered based on e-value and HSP coverage to remove false positives (Schnable and Lyons, ). The remaining putative DA receptor gene protein coding sequences were exported and compiled into multi-FASTA files. To serve as an outgroup, the protein coding sequence for a Ciona D1-like receptor was acquired from GenScript (GenScript)1, and appended to the multi-FASTA (Supplementary File 4).
To infer an evolutionary model for the origin of DA receptors, DA receptor protein coding sequences were obtained for a range of vertebrate and invertebrate species, including human, mouse, chicken, turkey, peregrine falcon, alligator, fugu, lamprey, ciona, and drosophila. Protein coding sequences for chicken, turkey, human, and mouse D1 and D2 receptors, and fugu D2 receptors were obtained from Ensembl. Fugu D1 sequences were identified with CoGeBLAST using the orthologous chicken gene coding sequences as queries and confirmed via microsynteny analyses (Supplemental File 3). The remaining bird sequences were identified in this study. Alligator sequences were obtained using CoGe tool SynFind (https://genomevolution.org/CoGe/SynFind.pl) using bird receptors as queries to identify orthologous genes (and genomic regions) in alligator. Ciona D1-like was obtained from GenScript. To serve as the outgroup, Drosophila dopamine 1 receptor (Dop1) protein coding sequence was acquired from Flybase (Pierre et al., ). All coding sequences were compiled into a multi-FASTA file (Supplementary File 5) for alignment and phylogenetic analysis. Accession IDs for all sequences obtained from external resources other than CoGe may be found in Table 3.
Phylogenetic analysis
All phylogenetic analyses were generated from protein-coding sequences (Figures 1A,B, 7A). For each analysis, the respective multi-FASTA file(s) were uploaded to the iPlant Data Store (Goff et al., ). Sequences were aligned into PHYLIP Interleaved format using the iPlant Discovery Environment (DE) implementation of MUSCLE 3.8.31 (Supplementary Files 6, 7) (Edgar, ). To confirm the validity of programmatic (fully automated) alignment, the multiple sequence alignments used for construction of all phylogenies were visualized using Jalview (Waterhouse et al., ). Minor errors were observed, but manual adjustment did not create well-supported (bootstrap values) improvements to the gene-tree in regards to species-level relationships (Supplemental File 2: Figures S2A, S2B). As such, automatic alignments were used for all analyses, which also increases the reproducibility of our study. Phylogenetic analyses were performed on the aligned sequences using a customized iPlant implementation of RAxML 8.1.2 2 (“Protein-Based RAxML Analysis”) (Goff et al., ; Stamatakis, ). Best ML trees were calculated from 20 replicates using the PROTGAMMA Model and WAG AA substitution matrix. To generate a final tree with bootstrap support values, 100 bootstrap replicates were computed, using the same model as for best tree calculations, and the values were drawn onto the best tree using the PROTCAT model. Phylogenetic analyses can be regenerated from the Supplementary Material multi-FASTA files (Supplementary Files 4, 5) with the iPlant DE tools “MUSCLE 3.8.31” and “Protein-Based RAxML Analysis.” Alternately, phylogenies can be regenerated directly from the multiple sequence alignments (Supplementary Files 6, 7) using only iPlant DE tool “Protein-Based RAxML Analysis” or any RAxML 8.1.2 installation. Tree visualizations with bootstrap support values were performed using Dendroscope (Huson and Scornavacca, ), and visualizations with branch lengths using FigTree (http://tree.bio.ed.ac.uk/software/figtree/).
Figure 1
Synteny analysis
Microsynteny
Genomic microsynteny analyses were performed to confirm putative gene identity and to differentiate between absent and unannotated receptors. All microsynteny visualizations were generated using the GEvo tool, available through the comparative genomics platform CoGe (www.genomevolution.org). GEvo plots annotated gene features along chromosome panels of user-designated lengths, and uses LAST algorithms to identify regions of high sequence similarity. Regions with high sequence similarity are physically connected using colored boxes and lines. Annotated features overlapped by these regions are colored purple, so that users can infer collinear gene order of homologous gene pairs between multiple species/chromosomes. All GEvo microsynteny analyses have links provided in figure legends to regenerate the analysis and visualization.
Genome wide synteny
To identify genome wide duplication patterns and investigate the nature of the DA receptor duplications, the CoGe tool SynMap was used to generate a full-genome syntenic dot plot between Chicken and Turkey (Figure 5). Synonymous mutation (Ks) values for each syntenic gene pair were calculated using CODEML (Yang,
Raw reads BLAST search
To confirm the gene absences indicated by microsynteny analysis, BLAST searches were performed against the raw sequencing reads for target bird genomes. Raw reads were extracted from SRA files obtained from the NCBI BioProject for each avian species (Table 1), and BLAST was performed using the NCBI blastn suite (Altschul et al.,
Human ortholog comparisons
To test for possible additional evidence into the WGD origin of the DRD1A/DRD1B receptors, maps comparing human chromosome 4 (containing DRD1, the DRD1A ortholog) and chromosome 5 (containing DRD5, the DRD1B ortholog) were generated using “Paralogons in the Human Genome” tool from the Wolfe lab (http://wolfe.ucd.ie/cgi-bin/dup_528/chrom_plot) (McLysaght et al.,
Results
Identification and classification of DA receptors in bird species
Sequence comparisons (BLAST) were used to identify putative DA receptor encoding genes in the 43 sequenced bird genomes (Table 1) (Zhang et al.,
Overall topography of the receptor phylogeny was similar to previous reports, with two major monophyletic clades comprised of the two major classes of DA receptors, D1 and D2 (Figure 1). The D1 clade was broken into 4 major sub-clades, each containing exclusively DRD1A, DRD1B, DRD1C, or DRD1E receptors. Phylogenetic relationships between the receptors, (DRD1E,(DRD1C,(DRD1A,DRD1B))), were congruent with those reported by Yamamoto et al. (
The D2 clade was broken into three major sub-clades, each monophyletic for DRD2, DRD3, and DRD4 receptors. Branching relationships were congruent with previously published relationships (DRD4,(DRD2,DRD3)), with DRD4 diverging fastest, both as a family within the D2 class and as individual receptors within the DRD4 clade (Le Crom et al.,
Species relationships within each receptor clade were not congruent with the avian species tree phylogeny (Jarvis et al.,
Independent preferential loss of derived DA receptors among bird species
Not all receptors could be identified in the genome assemblies of all species of birds (Figure 2), suggesting losses could have occurred in some lineages. To test and confirm this possibility, we performed additional BLAST searches against the raw genomic sequencing reads for each missing receptor to ensure that receptor identification was not hampered by incomplete assembly of the genome in current builds (examples in Figure 3, Table 1, Supplementary File 1).
Figure 2

Receptor presence/absence by species. Presence and absence of dopamine receptors (right) across the phylogeny of the 43 investigated bird species (left; tree adapted from Jarvis et al.,
Figure 3

Dopamine receptor identification by synteny. Genes were confirmed via microsynteny analysis with known genes from Turkey or Chicken. Each comparison consists of genomic panels with the dashed line separating top and bottom strands of DNA. Gene models are represented by colored arrows above and below that dashed line, with gene symbols annotated on the bottom panel. Colored boxes drawn above gene models with lines connecting them between two genomic regions denote pair-wise regions of sequence similarity. Genes overlapped by regions of sequence similarity are colored purple; a colinear pattern of these genes is used to infer synteny. Analyses identified three situations: (A) Example of DRD1A confirmation, in American Crow and Peregrine Falcon. Confirmed genes are boxed in light blue. Note intact gene model (purple arrow), sequence similarity of genes, as well as collinear order of genes on surrounding genomic region. (B) Example of a missing gene (DRD1D), in Anna's Hummingbird. Syntenic genomic regions containing DRD1D are boxed in green and putatively absent DRD1D are indicated by a dashed red box. Note that there is no sequence similarity between syntenic regions where the gene should be present in hummingbird. (C) Example of a likely present but unannotated gene (DRD4) in Turkey. Solid red boxes indicate two present genes, and the dashed red box indicates the area with a missing gene model. Note the high level of sequence similarity between syntenic regions overlapping the missing gene (colored boxes with connecting colored wedges). Analyses can be regenerated at:(A)http://genomevolution.org/r/e7sj(B) http://genomevolution.org/r/e84r(C) http://genomevolution.org/r/e85v.
For the D1 DA receptors, DRD1A was successfully identified and classified in all investigated species (Figure 2, closed circles). DRD1B was identified in the genome builds of all species except White-Tailed Eagle. However, microsynteny alignments showed a scaffold (#35946) that contained a section of DRD1B coding sequence on its terminal end, indicative that the gene was not fully assembled following genome sequencing (Figure 2, open circle). Additionally, when DRD1B receptor coding sequences from Nightjar, Peregrine Falcon, Northern Fulmar, Medium Ground Finch, and Turkey were used as BLAST queries against the raw genomic sequencing reads from White-Tailed Eagle, matching BLAST hits covered almost the entire length of the query sequences (Table 1, Supplementary File 1). Thus, the White Tailed Eagle DRD1B is present, but lacking in annotation due to an incomplete genome assembly. DRD1C was not identified in the genome assembly of three distantly related species, golden collared manakin, great black cormorant, and Anna's hummingbird. Microsynteny analyses and BLAST searches of the raw sequencing reads suggested DRD1C was truly absent in these species (Figure 2, no circle; Table 1, Supplementary File 1). DRD1E was not found in the genome assembly builds of 7 of the 43 species, but only one (parakeet/budgerigar) appeared as a genuine loss (Figure 2, open circles; Table 1, Supplementary File 1). Overall, DRD1A and DRD1B were present in all species, while DRD1C and DRD1E were lost in some lineages independently. Notably, D1 receptor losses appear to correspond to the fastest diverging receptors, DRD1C and DRD1E (Figure 1B).
For D2 DA receptors, DRD2 and DRD3 were present in the genome builds of all avian species, but we were unable to find the DRD4 receptor in the genome builds over half of the species. However, microsynteny analysis and BLAST searches of the raw sequencing reads showed fragments of DRD4 were present in all the remaining half of the species, either in unannotated assembled regions or in unassembled regions (Figures 2, 3C, Supplementary File 1). These absences were not limited to low coverage genomes, and included high-coverage, well-assembled genomes with scaffold N50s > 1Mb [medium ground finch, Dalmatian pelican, hoatzin, Anna's hummingbird, Chimney swift, pigeon, duck, and ostrich (Zhang et al.,
D1 and D2 vertebrate dopamine receptor families expanded through whole genome duplication
As expected, genomic regions containing each receptor showed strong synteny with regions from other species that contained the orthologous receptor (e.g., Figures 3A, 4A). Interestingly, the genomic regions containing the D1 receptors also showed weak but persistent synteny signals with the genomic regions that contained the other D1 receptors (strongest between DRD1A/DRD1B), both within and across species (Figure 4, Table 2). A similar pattern of intragenomic synteny evidence was observed between chromosome regions containing DRD2 and DRD3 (Figure 4B, Table 2). This synteny signal indicated that these genomic regions, and thus these receptor sets, might have originated from a common ancestor by duplication of large chromosomal regions (e.g., polyploidy). This pattern was not observed between any Class 1 and Class 2 receptors, or between the other D2 receptor pairs (Figure 4B).
Figure 4

DA receptor microsynteny analysis. (A) Visualization of shared microsynteny between human, chicken, and turkey chromosome regions containing DRD1A (human DRD1) and DRD1B (human DRD5) receptors. Strong synteny can be seen between Turkey/Chicken DRD1A receptors and Turkey/Chicken DRD1B receptors. Human chromosome regions containing DRD1 and DRD5 receptors show clear synteny with respective orthologs containing chicken and turkey chromosome regions. Chicken DRD1A and DRD1B regions show synteny with one another, although the signal is weaker than between orthologous receptors in different species. (B) Microsynteny visualizations (left) and phylogenetic relationships (right) of all DA receptors in Chicken. Microsynteny can be observed between D1 receptors, with the strongest between DRD1A and DRD1B. No pattern of microsynteny is seen between the two classes of receptors, with weak synteny seen between DRD2 and DRD3. Details on how to read these microsynteny analyses are described in Figure 3. Results can be regenerated at (A)https://genomevolution.org/r/ho6d and (B)http://genomevolution.org/r/hjp0.
Table 2
| Microsynteny | SynFind | SynMap | Human Paralogy | |||
|---|---|---|---|---|---|---|
| 2 MB | 12 MB | Colinear | Density | |||
| DRD1A/DRD1B | Y | Y | Y | Y | Y | Y |
| DRD1A/DRD1C | Y | Y | Y | Y | Y | N |
| DRD1A/DRD1E | Y | Y | Y | Y | N | N |
| DRD1B/DRD1C | Y | Y | N | Y | Y | N |
| DRD1B/DRD1E | Y | Y | N | Y | N | N |
| DRD1C/DRD1E | Y | Y | N | N | N | N |
| DRD2/DRD3 | N | Y | N | Y | N | Y |
| DRD2/DRD4 | N | N | N | N | N | N |
| DRD3/DRD4 | N | N | N | N | N | N |
Duplication evidence summary.
Evidence supporting a WGD origin of each DA receptor pair. For the D1 receptors, evidence follows a pattern consistent with fractionation loss across all D1 receptors, but is only conclusive for DRD1A/DRD1B. For D2 receptors, evidence suggests DRD2/DRD3 arose from a WGD, but the evidence is weaker than for DRD1A/DRD1B. In no cases was evidence for duplication between classes D1 and D2 receptors found (not shown). Microsynteny analyses were performed with GEvo for two genomic windows: 2 MB and 12 MB. SynFind analyses were performed using both collinear and density based scoring algorithms, with a gene window size of 40 and minimum number of four syntenic genes. SynMap analyses aimed to identify whether gene pairs were located in chromosome regions with evidence of a WGD. GEvo, SynFind, and SynMap are all CoGe tools, are available online at: http://www.genomevolution.org (Lyons and Freeling,
Table 3
| Species | Gene name | Accession ID |
|---|---|---|
| Chicken | DRD1A | ENSEMBL: ENSGALG00000003270 |
| Chicken | DRD1B | ENSEMBL: ENSGALG00000014975 |
| Chicken | DRD2 | ENSEMBL: ENSGALG00000007794 |
| Chicken | DRD3 | ENSEMBL: ENSGALG00000015117 |
| Chicken | DRD4 | ENSEMBL: ENSGALG00000005085 |
| Turkey | DRD1A | ENSEMBL: ENSMGAG00000015442 |
| Turkey | DRD1B | ENSEMBL: ENSMGAG00000012700 |
| Turkey | DRD2 | ENSEMBL: ENSMGAG00000004223 |
| Turkey | DRD3 | ENSEMBL: ENSMGAG00000014321 |
| Turkey | DRD4 | ENSEMBL: ENSMGAG00000003742 |
| Zebra Finch | DRD1A | ENSEMBL: ENSTGUG00000000340 |
| Zebra Finch | DRD1B | ENSEMBL: ENSTGUG00000009908 |
| Zebra Finch | DRD2 | ENSEMBL: ENSTGUG00000000255 |
| Zebra Finch | DRD3 | ENSEMBL: ENSTGUG00000013405 |
| Zebra Finch | DRD4 | ENSEMBL: ENSTGUG00000007173 |
| Human | DRD1 | ENSEMBL: ENSG00000184845 |
| Human | DRD2 | ENSEMBL: ENSG00000149295 |
| Human | DRD3 | ENSEMBL: ENSG00000151577 |
| Human | DRD4 | ENSEMBL: ENSG00000069696 |
| Human | DRD5 | ENSEMBL: ENSG00000169676 |
| Mouse | DRD1A | ENSEMBL: ENSMUSG00000021478 |
| Mouse | DRD1B | ENSEMBL: ENSMUSG00000039358 |
| Mouse | DRD2 | ENSEMBL: ENSMUSG00000032259 |
| Mouse | DRD3 | ENSEMBL: ENSMUSG00000022705 |
| Mouse | DRD4 | ENSEMBL: ENSMUSG00000025496 |
| Fugu | DRD2 | ENSEMBL: ENSTRUG00000014690 |
| Fugu | DRD3 | ENSEMBL: ENSTRUG00000000584 |
| Fugu | DRD4 | ENSEMBL: ENSTRUG00000012466 |
| Lamprey | DRD1A/B | ENSEMBL: ENSPMAG00000010421 |
| Ciona | D1-Like | GenScript: XM_004226475 |
| Drosophila | Dop1 (D1-like) | Flybase: FBgn0011582 |
Reference receptor accessions.
Accession numbers for receptors sequences obtained from external resources. Coding sequence for each receptor was obtained from the listed databases.
To uncover more information on the nature of these possible duplications, we used SynMap (Lyons and Freeling,
Figure 5

Chicken/turkey synonymous dot plot. (A) Syntenic dot plot of Chicken (Y-axis) and Turkey (X-axis) genomes. Horizontal or vertical gray lines delineate chromosomes. Each dot represents an orthologous gene pair. Dots are colored based on synonymous mutation rate (Ks) as calculated between each orthologous gene pair. (B) Histogram of log10 transformed Ks values. Three major groupings of Ks values can be identified. Blue/Green is typical of orthologous genes derived from the divergence of lineages (*). Orange is algorithmic noise (**; Ks values >80 substitutions per synonymous site). Small red group is indicative of an ancient whole genome duplication event (***). DRD1A/DRD1B genes fall within this latter cluster, and are shown in the red boxes in (A). Analysis can be regenerated at https://genomevolution.org/r/drgh.
Of the D1 receptors, gene pairs DRD1A/DRD1B were located in regions identified by SynMap with evidence of originating through a WGD, DRD1A/DRD1C, and DRD1B/DRD1C were located near, but not within such regions, and WGD evidence was not present in regions containing DRD1E (Figure 5, Table 2). This pattern of conservation is consistent with the fractionation of homeologous (partially homologous) gene content that follows polyploidy, as has been shown in plant systems (Thomas et al.,
Additional evidence for the WGD origin of the DRD1A/DRD1B and DRD2/DRD3 receptors was sought through comparative analysis with the human genome, where we investigated whether the human orthologs of the avian genes were located in chromosome regions previously identified as likely derived from an ancient vertebrate WGD (McLysaght et al.,
Figure 6

Human paralagon maps. Human chromosome maps showing regions previously classified as likely having originated from a whole genome duplication event in vertebrate history. (A) Human DRD1 (light blue star) and DRD5 (dark blue star) are marked on their respective chromosomes, and can be seen to fall within these regions. (B) Human DRD2 (orange star), DRD3 (purple star), and DRD4 (red star) mapped on their respective chromosomes. DRD2/DRD3 fall within these regions. Maps were generated using “Paralogons in the Human Genome” tool from the Wolfe lab, and are based on a minimum threshold for paired proteins per block greater than 12 (A) or 7 (B). Paired proteins per block greater than 6 are statistically probable as having occurred from a WGD event rather than by chance (McLysaght et al.,
Evolutionary analysis of DRD1A/DRD1B duplication in the vertebrate linage
To determine if the WGD that gave rise to the DA receptor expansion was specific to the vertebrate lineage, we constructed a phylogeny of DA receptors across a range of vertebrate and invertebrate species rooted on an arthropod, drosophila (Figure 7A). As expected, each family of receptors formed distinct clades, each monophyletic for that family. Using our methods for gene identification, duplicates of each D1 receptors DRD1A, DRD1B, and DRD1C were identified in fugu (Figure 7A, Supplemental File 3). These duplications are likely explained by an additional WGD that occurred in the fugu lineage (Panopoulou and Poustka,
Figure 7

DRD1A/DRD1B evolutionary phylogeny and proposed model. (A) Phylogenetic relationship of D1 and D2 dopamine receptors for two invertebrate (Drosophila and Ciona D1-like) and seven vertebrates (Lamprey, Fugu, Alligator, Turkey, Chicken, Mouse, and Human) species based on protein coding sequences. Bootstrap support values are shown at nodes; branch lengths (substitutions/changes per site) are shown proportionally. DRD1C, DRD1D, and all D2 clades are collapsed for clarity, but expanded clades may be found in Supplemental File 3: Figure S5. Note that fugu has an independent whole genome duplication. (B) Illustrated model of DRD1A/DRD1B evolution, based on events inferred from the evolutionary phylogeny.
Discussion
Understanding DA and its associated receptors is critical for both health and agricultural sciences due to the significant roles they play in in behavior, learning, and general neuromodulation. In this study we aimed to expand our understanding of both the recent evolution of DA receptors across the avian lineage as well as the ancient evolution of DA receptors throughout the larger vertebrate lineage. Using a combination of phylogenetic and sequence analyses, we identified and classified the DA receptors in 43 recently sequenced bird species spanning the diversity of extant avian species. Our analyses show that the ancestral bird genome contained at least 7 DA receptors: four D1 receptors (DRD1A, DRD1B, DRD1C, and DRD1E) and three D2 receptors (DRD2, DRD3, and DRD4). Two of the D1 receptors (DRD1A, DRD1B) and two of the D2 receptors (DRD2 and DRD3) were maintained across all extant species. The other two D1 receptors, DRD1C and DRD1E, have been lost in some lineages independently, although they are still widely conserved across the lineage. The remaining D2 receptor DRD4 has been either pseudogenized and thus functionally lost in many species, or has increased sequence complexity making it difficult to computationally identify, assemble, and annotate.
When receptor losses are compared to the species phylogeny, no apparent species-level pattern of gene loss can be identified. With regards to the Class 1 receptors, these findings strongly support earlier claims that DRD1C and DRD1E receptors can be more easily lost in multiple vertebrate lineages independently (Yamamoto et al.,
In terms of evolutionary history of DA receptors, while there has been long-standing speculation that D1 receptors DRD1A and DRD1B could have originated from a WGD event, conclusive evidence has not been provided (Holland,
We hypothesize that the weaker colinear signal of homologous genes from the genomic regions containing DRD2 and DRD3 could be due to the diploidization process following polyploidy, where many duplicated homeologous genes are lost through a process known as fractionation, relatively quickly reducing the gene content of a genome to a state more similar to the pre-duplicated ancestral genome (Woodhouse et al.,
Yamamoto et al. (
In summary, our evidence suggests a large scale, retained expansion of the DA receptor family through an early vertebrate WGD event, followed by subsequent losses of the more rapidly diverging receptors in some species lineages independently, suggesting relaxed functional constraints on some receptors in the DA receptor superfamily. We propose that the early vertebrate ancestor possessed only one D1 receptor. An ancient WGD, likely one of the vertebrate-lineage-specific 1R or 2R events (Dehal and Boore,
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Statements
Acknowledgments
We are thankful for the insightful suggestions made by one of the reviewers that substantially strengthened this work. AH is funded by the UA Arizona Biological and Biomedical Sciences Program. The 43 bird genomes used in this study were kindly made available pre-release by the Avian Phylogenomics Consortium (http://avian.genomics.cn/en/); we thank co-coordinators of that project, Guojie Zhang of BGI and Tom Gilbert of the University of Copenhagen, along with co-author EJ, for early access to these data. CoGe is supported by the Gordon and Betty Moore Foundation (#3383) and the USDA (NIFA: GRANT11301623), and is Powered by iPlant (NSF DBI – 1265383). Phylogenetic analyses were done using iPlant resources (NSF DBI – 1265383). EHL is supported by NSF (IOS – 1339156). EDJ is supported by funding from HHMI.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary material
The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fnins.2015.00361
Footnotes
1.^GenScript - Make Research Easy - The leader in molecular cloning and gene synthesis, peptide synthesis, protein and antibody engineering. Available online at: http://www.genscript.com/ (Accessed December 19, 2014).
References
1
AltschulS. F.GishW.MillerW.MyersE. W.LipmanD. J. (1990). Basic local alignment search tool. J. Mol. Biol.215, 403–410. 10.1016/S0022-2836(05)80360-2
2
BeaulieuJ.-M.GainetdinovR. R. (2011). The physiology, signaling, and pharmacology of dopamine receptors. Pharmacol. Rev.63, 182–217. 10.1124/pr.110.002642
3
Ben-IsraelS.UzefovskyF.EbsteinR. P.Knafo-NoamA. (2015). Dopamine D4 receptor polymorphism and sex interact to predict children's affective knowledge. Front. Psychol.6:846. 10.3389/fpsyg.2015.00846
4
BlancG.HokampK.WolfeK. H. (2003). A recent polyploidy superimposed on older large-scale duplications in the arabidopsis genome. Genome Res.13, 137–144. 10.1101/gr.751803
5
BlancG.WolfeK. H. (2004). Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell16, 1667–1678. 10.1105/tpc.021345
6
CallierS.SnapyanM.Le CromS.ProuD.VincentJ.-D.VernierP. (2003). Evolution and cell biology of dopamine receptors in vertebrates. Biol. Cell Auspices Eur. Cell Biol. Organ.95, 489–502. 10.1016/S0248-4900(03)00089-3
7
ChaisehaY.YoungrenO.Al-ZailaieK.El HalawaniM. (2003). Expression of D1 and D2 dopamine receptors in the hypothalamus and pituitary during the turkey reproductive cycle: colocalization with vasoactive intestinal peptide. Neuroendocrinology77, 105–118. 10.1159/000068649
8
ChengF.WuJ.FangL.SunS.LiuB.LinK.et al. (2012). Biased gene fractionation and dominant gene expression among the subgenomes of Brassica rapa. PLoS ONE7:e36442. 10.1371/journal.pone.0036442
9
DalloulR. A.LongJ. A.ZiminA. V.AslamL.BealK.Ann BlombergL.et al. (2010). Multi-platform next-generation sequencing of the domestic Turkey (Meleagris gallopavo): genome assembly and analysis. PLoS Biol.8:e1000475. 10.1371/journal.pbio.1000475
10
DehalP.BooreJ. L. (2005). Two rounds of whole genome duplication in the ancestral vertebrate. PLoS Biol.3:e314. 10.1371/journal.pbio.0030314
11
EdgarR. C. (2004). MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res.32, 1792–1797. 10.1093/nar/gkh340
12
FidlerA. E.van OersK.DrentP. J.KuhnS.MuellerJ. C.KempenaersB. (2007). Drd4 gene polymorphisms are associated with personality variation in a passerine bird. Proc. R. Soc. Lond. B. Biol. Sci.274, 1685–1691. 10.1098/rspb.2007.0337
13
FlicekP.AmodeM. R.BarrellD.BealK.BillisK.BrentS.et al. (2014). Ensembl 2014. Nucleic Acids Res.42, D749–D755. 10.1093/nar/gkt1196
14
GanapathyG.HowardJ. T.WardJ. M.LiJ.LiB.LiY.et al. (2014). High-coverage sequencing and annotated assemblies of the budgerigar genome. GigaScience3:11. 10.1186/2047-217X-3-11
15
GoffS. A.VaughnM.McKayS.LyonsE.StapletonA. E.GesslerD.et al. (2011). The iPlant collaborative: cyberinfrastructure for plant biology. Plant Genet. Genomics2:34. 10.3389/fpls.2011.00034
16
HillierL. W.MillerW.BirneyE.WarrenW.HardisonR. C.PontingC. P.et al. (2004). Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature432, 695–716. 10.1038/nature03154
17
HollandP. W. H. (1999). Gene duplication: past, present and future. Semin. Cell Dev. Biol.10, 541–547. 10.1006/scdb.1999.0335
18
HuangY.LiY.BurtD. W.ChenH.ZhangY.QianW.et al. (2013). The duck genome and transcriptome provide insight into an avian influenza virus reservoir species. Nat. Genet.45, 776–783. 10.1038/ng.2657
19
HusonD. H.ScornavaccaC. (2012). Dendroscope 3: an interactive tool for rooted phylogenetic trees and networks. Syst. Biol.61, 1061–1067. 10.1093/sysbio/sys062
20
JarvisE. D.MirarabS.AbererA. J.LiB.HoudeP.LiC.et al. (2014). Whole-genome analyses resolve early branches in the tree of life of modern birds. Science346, 1320–1331. 10.1126/science.1253451
21
KreitmanM.AkashiH. (1995). Molecular evidence for natural selection. Annu. Rev. Ecol. Syst.26, 403–422. 10.1146/annurev.es.26.110195.002155
22
KubikovaL.WadaK.JarvisE. D. (2010). Dopamine receptors in a songbird brain. J. Comp. Neurol.518, 741–769. 10.1002/cne.22255
23
Le CromS.KapsimaliM.BarômeP.-O.VernierP. (2003). Dopamine receptors for every species: gene duplications and functional diversification in craniates. J. Struct. Funct. Genomics3, 161–176. 10.1023/A:1022686622752
24
LyonsE.FreelingM. (2008). How to usefully compare homologous plant genes and chromosomes as DNA sequences. Plant J.53, 661–673. 10.1111/j.1365-313X.2007.03326.x
25
McLysaghtA.HokampK.WolfeK. H. (2002). Extensive genomic duplication during early chordate evolution. Nat. Genet.31, 200–204. 10.1038/ng884
26
PanopoulouG.PoustkaA. J. (2005). Timing and mechanism of ancient vertebrate genome duplications – the adventure of a hypothesis. Trends Genet.21, 559–567. 10.1016/j.tig.2005.08.004
27
PébusqueM. J.CoulierF.BirnbaumD.PontarottiP. (1998). Ancient large-scale genome duplications: phylogenetic and linkage analyses shed light on chordate genome evolution. Mol. Biol. Evol.15, 1145–1159. 10.1093/oxfordjournals.molbev.a026022
28
PierreS. E. S.PontingL.StefancsikR.McQuiltonP. (2014). FlyBase 102—advanced approaches to interrogating FlyBase. Nucleic Acids Res.42, D780–D788. 10.1093/nar/gkt1092
29
PutnamN. H.ButtsT.FerrierD. E. K.FurlongR. F.HellstenU.KawashimaT.et al. (2008). The amphioxus genome and the evolution of the chordate karyotype. Nature453, 1064–1071. 10.1038/nature06967
30
RobertsonB.Huerta-OcampoI.EricssonJ.Stephenson-JonesM.Pérez-FernándezJ.BolamJ. P.et al. (2012). The dopamine D2 receptor gene in lamprey, its expression in the striatum and cellular effects of D2 receptor activation. PLoS ONE7:e35642. 10.1371/journal.pone.0035642
31
SankoffD.ZhengC. (2012). Fractionation, rearrangement and subgenome dominance. Bioinformatics28, i402–i408. 10.1093/bioinformatics/bts392
32
SartsoongnoenN.KosonsirilukS.PrakobsaengN.SongsermT.RozenboimI.HalawaniM. E.et al. (2008). The dopaminergic system in the brain of the native Thai chicken, Gallus domesticus: localization and differential expression across the reproductive cycle. Gen. Comp. Endocrinol.159, 107–115. 10.1016/j.ygcen.2008.08.002
33
SasakiA.SotnikovaT. D.GainetdinovR. R.JarvisE. D. (2006). Social context-dependent singing-regulated dopamine. J. Neurosci.26, 9010–9014. 10.1523/JNEUROSCI.1335-06.2006
34
SchnableJ. C.LyonsE. (2012). Comparative Genomics with Maize and other Grasses: From Genes to Genomes! Maydica, Vol. 56. Available online at: http://cra-journals.cineca.it/index.php/maydica/article/view/690 (Accessed December 19, 2014).
35
SchnableJ. C.SpringerN. M.FreelingM. (2011). Differentiation of the maize subgenomes by genome dominance and both ancient and ongoing gene loss. Proc. Natl. Acad. Sci. U.S.A.108, 4069–4074. 10.1073/pnas.1101368108
36
SchnellS. A.YouS.El HalawaniM. E. (1999). D1 and D2 dopamine receptor messenger ribonucleic acid in brain and pituitary during the reproductive cycle of the turkey hen. Biol. Reprod.60, 1378–1383. 10.1095/biolreprod60.6.1378
37
ShapiroM. D.KronenbergZ.LiC.DomyanE. T.PanH.CampbellM.et al. (2013). Genomic diversity and evolution of the head crest in the rock pigeon. Science339, 1063–1067. 10.1126/science.1230422
38
SmithS. F.SnellP.GruetznerF.BenchA. J.HaafT.MetcalfeJ. A.et al. (2002). Analyses of the extent of shared synteny and conserved gene orders between the genome of fugu rubripes and human 20q. Genome Res.12, 776–784. 10.1101/gr.221802
39
StamatakisA. (2014). RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics30, 1312–1313. 10.1093/bioinformatics/btu033
40
StoofJ. C.KebabianJ. W. (1984). Two dopamine receptors: biochemistry, physiology and pharmacology. Life Sci.35, 2281–2296. 10.1016/0024-3205(84)90519-8
41
TangH.LyonsE. (2012). Unleashing the genome of Brassica Rapa. Front. Plant Sci.3:172. 10.3389/fpls.2012.00172
42
ThomasB. C.PedersenB.FreelingM. (2006). Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes. Genome Res.16, 934–946. 10.1101/gr.4708406
43
WangX.WangH.WangJ.SunR.WuJ.LiuS.et al. (2011). The genome of the mesopolyploid crop species Brassica rapa. Nat. Genet.43, 1035–1039. 10.1038/ng.919
44
WarrenW. C.ClaytonD. F.EllegrenH.ArnoldA. P.HillierL. W.KünstnerA.et al. (2010). The genome of a songbird. Nature464, 757–762. 10.1038/nature08819
45
WaterhouseA. M.ProcterJ. B.MartinD. M. A.ClampM.BartonG. J. (2009). Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics25, 1189–1191. 10.1093/bioinformatics/btp033
46
WolfeK. H. (2001). Yesterday's polyploids and the mystery of diploidization. Nat. Rev. Genet.2, 333–341. 10.1038/35072009
47
WoodhouseM. R.SchnableJ. C.PedersenB. S.LyonsE.LischD.SubramaniamS.et al. (2010). Following tetraploidy in maize, a short deletion mechanism removed genes preferentially from one of the two homeologs. PLoS Biol.8:e1000409. 10.1371/journal.pbio.1000409
48
YamamotoK.MirabeauO.BureauC.BlinM.Michon-CoudouelS.DemarqueM.et al. (2013). Evolution of dopamine receptor genes of the D1 class in vertebrates. Mol. Biol. Evol.30, 833–843. 10.1093/molbev/mss268
49
YamamotoK.VernierP. (2011). The evolution of dopamine systems in chordates. Front. Neuroanat.5:21. 10.3389/fnana.2011.00021
50
YangZ. (2007). PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol.24, 1586–1591. 10.1093/molbev/msm088
51
ZhanX.PanS.WangJ.DixonA.HeJ.MullerM. G.et al. (2013). Peregrine and saker falcon genome sequences provide insights into evolution of a predatory lifestyle. Nat. Genet.45, 563–566. 10.1038/ng.2588
52
ZhangG.LiC.LiQ.LiB.LarkinD. M.LeeC.et al. (2014). Comparative genomics reveals insights into avian genome evolution and adaptation. Science346, 1311–1320. 10.1126/science.1251385
Summary
Keywords
dopamine receptors, comparative genomics, genome evolution, polyploidy, whole genome duplication
Citation
Haug-Baltzell A, Jarvis ED, McCarthy FM and Lyons E (2015) Identification of dopamine receptors across the extant avian family tree and analysis with other clades uncovers a polyploid expansion among vertebrates. Front. Neurosci. 9:361. doi: 10.3389/fnins.2015.00361
Received
26 July 2015
Accepted
21 September 2015
Published
07 October 2015
Volume
9 - 2015
Edited by
Igor Ponomarev, The University of Texas at Austin, USA
Reviewed by
Philippe Vernier, Centre National de la Recherche Scientifique, France; Carolina Frankl-Vilches, Max Planck Institute for Ornithology, Germany
Updates

Check for updates
Copyright
© 2015 Haug-Baltzell, Jarvis, McCarthy and Lyons.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Eric Lyons, BIO5 Institute, School of Plant Science, University of Arizona, 1657 E. Helen St. Tucson, AZ 85745, USA ericlyons@email.arizona.edu
This article was submitted to Neurogenomics, a section of the journal Frontiers in Neuroscience
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.