<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Archiving and Interchange DTD v2.3 20070202//EN" "archivearticle.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="data-paper">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Ecol. Evol.</journal-id>
<journal-title>Frontiers in Ecology and Evolution</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Ecol. Evol.</abbrev-journal-title>
<issn pub-type="epub">2296-701X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fevo.2020.554164</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Ecology and Evolution</subject>
<subj-group>
<subject>Data Report</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>The First Pycnogonid Draft Genome of <italic>Nymphon striatum</italic></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Jeong</surname> <given-names>Jin-Hyeop</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/690620/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Kim</surname> <given-names>Heesoo</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/762055/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Ryu</surname> <given-names>Seongho</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1122941/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Kim</surname> <given-names>Won</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c002"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/685563/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>School of Biological Sciences, Seoul National University</institution>, <addr-line>Seoul</addr-line>, <country>South Korea</country></aff>
<aff id="aff2"><sup>2</sup><institution>Soonchunhyang Institute of Medi-Bio Science, Soonchunhyang University</institution>, <addr-line>Cheonan</addr-line>, <country>South Korea</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Enrique P. Lessa, Universidad de la Rep&#x000FA;blica, Uruguay</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Jesus Lozano-Fernandez, Instituto de Biolog&#x000ED;a Evolutiva (IBE), Spain; Rosa M. Fernandez, Harvard University, United States</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Seongho Ryu <email>ryu&#x00040;sch.ac.kr</email></corresp>
<corresp id="c002">Won Kim <email>wonkim&#x00040;plaza.snu.ac.kr</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Phylogenetics, Phylogenomics, and Systematics, a section of the journal Frontiers in Ecology and Evolution</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>09</day>
<month>11</month>
<year>2020</year>
</pub-date>
<pub-date pub-type="collection">
<year>2020</year>
</pub-date>
<volume>8</volume>
<elocation-id>554164</elocation-id>
<history>
<date date-type="received">
<day>21</day>
<month>04</month>
<year>2020</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>10</month>
<year>2020</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2020 Jeong, Kim, Ryu and Kim.</copyright-statement>
<copyright-year>2020</copyright-year>
<copyright-holder>Jeong, Kim, Ryu and Kim</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions> <kwd-group>
<kwd><italic>Nymphon striatum</italic></kwd>
<kwd>sea spider</kwd>
<kwd>Chelicerata</kwd>
<kwd>draft genome</kwd>
<kwd>PacBio sequencing</kwd>
</kwd-group>
<contract-num rid="cn001">20180430</contract-num>
<contract-num rid="cn001">20170431</contract-num>
<contract-sponsor id="cn001">Korea Institute of Marine Science and Technology promotion<named-content content-type="fundref-id">10.13039/501100011705</named-content></contract-sponsor>
<counts>
<fig-count count="1"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="35"/>
<page-count count="6"/>
<word-count count="3914"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>The Pycnogonida, or sea spiders, are essential arthropod taxa for understanding the early evolutionary history of arthropods and relationships between primary clades of this phylum. Studies based on both molecular and morphological data suggest that sea spiders have archaic origins as old as the early Cambrian. They are also considered an early-branching lineage with a deep split origin, which is implied by their incomplete early fossil records and relatively recent diversification of extant members (Dunlop and Selden, <xref ref-type="bibr" rid="B8">2009</xref>; Rota-Stabelli et al., <xref ref-type="bibr" rid="B29">2013</xref>; Sabroux et al., <xref ref-type="bibr" rid="B30">2019</xref>; Lozano-Fernandez et al., <xref ref-type="bibr" rid="B19">2020</xref>). However, their peculiar morphologies such as extremely reduced trunk and 8&#x02013;12 walking legs containing part of their digestive and reproductive organs are highly derived and very unusual among arthropods (Sabroux et al., <xref ref-type="bibr" rid="B30">2019</xref>). With a combination of the molecular phylogenetic studies using the limited numbers of marker genes, their peculiar adult and developmental morphologies had led the Cormogonida hypothesis (Zrzav&#x000FD; et al., <xref ref-type="bibr" rid="B35">1998</xref>; Giribet et al., <xref ref-type="bibr" rid="B12">2001</xref>; Dunlop and Arango, <xref ref-type="bibr" rid="B7">2005</xref>; Machner and Scholtz, <xref ref-type="bibr" rid="B20">2010</xref>), which places pycnogonids as a sister taxon to all other arthropods, as opposed to the monophyletic Chelicerata hypothesis, which recognizes their sister taxon relationship with the Euchelicerata. Unlike these molecular phylogenetics studies based on a few markers, numerous phylogenomic studies based on thoroughly sampled expressed sequences tags repeatedly designated sea spiders as basalmost chelicerates (Meusemann et al., <xref ref-type="bibr" rid="B23">2010</xref>; Regier et al., <xref ref-type="bibr" rid="B25">2010</xref>; Rehm et al., <xref ref-type="bibr" rid="B26">2011</xref>). The most recent phylogenomic researches incorporating significantly increased taxa and tested loci based on the whole-genome or transcriptome data (Sharma et al., <xref ref-type="bibr" rid="B31">2014</xref>; Ballesteros and Sharma, <xref ref-type="bibr" rid="B2">2019</xref>; Lozano-Fernandez et al., <xref ref-type="bibr" rid="B18">2019</xref>) have brought consensus on pycnogonids firmly nested in the Chelicerata.</p>
<p>These recent phylogenomic studies provided newly sequenced chelicerate genomes or transcriptomes, in addition to invaluable insights on the evolution of Euchelicerata (Sharma et al., <xref ref-type="bibr" rid="B31">2014</xref>; Ballesteros and Sharma, <xref ref-type="bibr" rid="B2">2019</xref>; Lozano-Fernandez et al., <xref ref-type="bibr" rid="B18">2019</xref>). However, studies using such an approach on the Pycnogonida are currently very limited and fail to incorporate a <italic>de novo</italic> assembled pycnogonid genome (Dietz et al., <xref ref-type="bibr" rid="B6">2019</xref>; Ballesteros et al., <xref ref-type="bibr" rid="B1">2020</xref>). This lack of pycnogonid genome assembly is also a challenging obstacle for studies on the evolution of their distinctive morphology that requires high-quality genome assemblies with reliable annotations (Garb et al., <xref ref-type="bibr" rid="B11">2018</xref>). To the best of our knowledge, no pycnogonid genome has been sequenced to date, and there are only three cases of unpublished flow cytometry reports on pycnogonid species so far (Animal Genome Size Database, <ext-link ext-link-type="uri" xlink:href="https://genomesize.com">https://genomesize.com</ext-link>). We aimed to provide an annotated pycnogonid genome sequence using the PacBio long-read <italic>de novo</italic> genome sequencing to provide useful information for understanding the evolution of unique pycnogonid morphological and developmental characteristics. Here, we report a draft of the first high quality annotated genome of a common sea spider species, <italic>Nymphon striatum</italic>, in Korean waters. This is the first <italic>de novo</italic> sequenced and assembled genome that represents the Class Pycnogonida.</p>
</sec>
<sec sec-type="results" id="s2">
<title>Results</title>
<p>The genome size of <italic>N. striatum</italic> was estimated at &#x0007E;607 Mb, while the genome survey result showed relatively high (&#x0007E;1.9%) heterozygosity ratios and prominent heterozygosity peaks clearly separated from the homozygosity peaks (<xref ref-type="supplementary-material" rid="SM2">Supplementary Figure 1</xref>). The almost equal distributions of non-duplicated (1X) and duplicated (2X) K&#x02013;mers at the homozygosity peak further supported the highly heterozygous nature of its genome (<xref ref-type="supplementary-material" rid="SM4">Supplementary Figure 3</xref>) that we speculated the pooling of <italic>N. striatum</italic> specimens was likely its main cause. The initial contig&#x02013;level genomic assembly of <italic>N. striatum</italic> was 1.26 Gb long, of which the total base length exceeded twice those of the genome survey results (<xref ref-type="table" rid="T1">Table 1A</xref>). Genome assembly assessment using Benchmarking Universal Single-Copy Orthologs (BUSCO) (Sim&#x000E3;o et al., <xref ref-type="bibr" rid="B32">2015</xref>) also showed that 53.47% of BUSCO genes from the initial assembly were duplicated, which further indicates the presence of numerous redundant haplotypic contigs (<xref ref-type="table" rid="T1">Table 1B</xref>). Therefore, these haplotypic contigs were reduced by applying Purge Haplotigs (Roach et al., <xref ref-type="bibr" rid="B27">2018</xref>) and Redundans (Pryszcz and Gabaldo&#x000F3;n, <xref ref-type="bibr" rid="B24">2016</xref>). The Purge Haplotigs-curated contig&#x02013;level draft genome demonstrated prominent improvements in its total length, which was reduced to 732.9 Mb, and contig N50 almost doubled to 360.90 Kb, with its number of contigs being reduced to 2,946 (<xref ref-type="table" rid="T1">Table 1A</xref>). Using arthropod database, the BUSCO assessment of the revised contig&#x02013;level assembly resulted in a substantial improvement: the percentage of complete, but duplicated BUSCO genes was reduced to 16.79% (<xref ref-type="table" rid="T1">Table 1B</xref> and <xref ref-type="supplementary-material" rid="SM3">Supplementary Figure 2</xref>). The K&#x02013;mer distribution analyses using the K&#x02013;mer analysis toolkit (Mapleson et al., <xref ref-type="bibr" rid="B21">2017</xref>), further found that the ratios of heterozygous haplotigs (indicated by 2X, or duplicated K&#x02013;mers) were greatly reduced at the homozygosity peak after running Purge Haplotigs (<xref ref-type="supplementary-material" rid="SM4">Supplementary Figure 3</xref>). After conducting scaffolding and gap closing, the final version of the scaffold&#x02013;level draft genome was composed of 1,638 scaffolds with scaffold N50 increased to 701.80 Kb and a minute fraction (0.04%) of N bases (<xref ref-type="table" rid="T1">Table 1A</xref>). We found that 31.9% of the genomic scaffolds were longer than 500 Kb, and among them, 159 scaffolds were over 1,000 Kb. In particular, only seven scaffolds were shorter than 10 Kb, which indicated that our genome was highly contiguous (<xref ref-type="fig" rid="F1">Figure 1A</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p> Summary of genome assembly and annotation of <italic>Nymphon striatum</italic>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th/>
<th valign="top" align="center" style="border-bottom: thin solid #000000;"><bold>Initial assembly</bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;"><bold>Curated assembly</bold></th>
<th valign="top" align="center" style="border-bottom: thin solid #000000;"><bold>Final assembly</bold></th>
</tr>
<tr>
<th valign="top" align="left" colspan="4" style="background-color:#bbbdc0"><bold>A. Summary of statistics of the genome assemblies</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Total bases (bases)</td>
<td valign="top" align="center">1,260,501,127</td>
<td valign="top" align="center">732,914,915</td>
<td valign="top" align="center">744,788,989</td>
</tr>
<tr>
<td valign="top" align="left">No. of contigs</td>
<td valign="top" align="center">8,733</td>
<td valign="top" align="center">2,946</td>
<td valign="top" align="center">2,946</td>
</tr>
<tr>
<td valign="top" align="left">Average contig length (bases)</td>
<td valign="top" align="center">144,337</td>
<td valign="top" align="center">248,783</td>
<td valign="top" align="center">248,783</td>
</tr>
<tr>
<td valign="top" align="left">Maximum contig length (bases)</td>
<td valign="top" align="center">2,477,793</td>
<td valign="top" align="center">2,479,102</td>
<td valign="top" align="center">2,479,102</td>
</tr>
<tr>
<td valign="top" align="left">Contig N50 (bp)</td>
<td valign="top" align="center">221,141</td>
<td valign="top" align="center">360,904</td>
<td valign="top" align="center">360,904</td>
</tr>
<tr>
<td valign="top" align="left">No. of scaffolds</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">1,638</td>
</tr>
<tr>
<td valign="top" align="left">Average scaffold length (bases)</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">454,694</td>
</tr>
<tr>
<td valign="top" align="left">Maximum scaffold length (bases)</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">3,927,965</td>
</tr>
<tr>
<td valign="top" align="left">Scaffold N50 (bases)</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">&#x02013;</td>
<td valign="top" align="center">701,800</td>
</tr>
<tr>
<td valign="top" align="left">N&#x00027;s (%)</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">0.04</td>
</tr>
<tr>
<td valign="top" align="left">GC ratio (%)</td>
<td valign="top" align="center">35.37</td>
<td valign="top" align="center">35.37</td>
<td valign="top" align="center">35.37</td>
</tr>
<tr>
<td valign="top" align="left" colspan="4" style="background-color:#bbbdc0"><bold>B. BUSCO validations of genome assemblies (arthropoda_odb9)</bold></td>
</tr>
<tr>
<td/>
<td valign="top" align="center"><bold>Initial assembly</bold></td>
<td valign="top" align="center"><bold>Curated assembly</bold></td>
<td valign="top" align="center"><bold>Final assembly</bold></td>
</tr>
<tr>
<td valign="top" align="left">Complete BUSCOs (C = S &#x0002B; D) (%)</td>
<td valign="top" align="center">96.53</td>
<td valign="top" align="center">95.22</td>
<td valign="top" align="center">96.53</td>
</tr>
<tr>
<td valign="top" align="left">Complete &#x00026; single-copy (S) (%)</td>
<td valign="top" align="center">43.06</td>
<td valign="top" align="center">78.42</td>
<td valign="top" align="center">78.80</td>
</tr>
<tr>
<td valign="top" align="left">Complete &#x00026; duplicated (D) (%)</td>
<td valign="top" align="center">53.47</td>
<td valign="top" align="center">16.79</td>
<td valign="top" align="center">17.73</td>
</tr>
<tr>
<td valign="top" align="left">Fragmented BUSCOs (%)</td>
<td valign="top" align="center">0.94</td>
<td valign="top" align="center">1.69</td>
<td valign="top" align="center">0.94</td>
</tr>
<tr>
<td valign="top" align="left">Missing BUSCOs (%)</td>
<td valign="top" align="center">2.53</td>
<td valign="top" align="center">3.10</td>
<td valign="top" align="center">2.53</td>
</tr>
<tr>
<td valign="top" align="left" colspan="4" style="background-color:#bbbdc0"><bold>C. Brief statistics of genomic annotations</bold></td>
</tr>
<tr>
<td valign="top" align="left">Total bases, repeat elements (bases)</td>
<td/>
<td/>
<td valign="top" align="center">52,434,830</td>
</tr>
<tr>
<td valign="top" align="left">No. of repeat elements (hits)</td>
<td/>
<td/>
<td valign="top" align="center">564,918</td>
</tr>
<tr>
<td valign="top" align="left">Genome coverage of repeats (%)</td>
<td/>
<td/>
<td valign="top" align="center">7.14</td>
</tr>
<tr>
<td valign="top" align="left">No. of predicted genes</td>
<td/>
<td/>
<td valign="top" align="center">28,539</td>
</tr>
<tr>
<td valign="top" align="left">Genome coverage of gene regions (%)</td>
<td/>
<td/>
<td valign="top" align="center">55.06</td>
</tr>
<tr>
<td valign="top" align="left" colspan="4" style="background-color:#bbbdc0"><bold>D. Gene annotations</bold></td>
</tr>
<tr>
<td valign="top" align="left">Blast hits</td>
<td/>
<td/>
<td valign="top" align="center">27,086</td>
</tr>
<tr>
<td valign="top" align="left">No hits</td>
<td/>
<td/>
<td valign="top" align="center">1,453</td>
</tr>
<tr>
<td valign="top" align="left">Average gene length (bases)</td>
<td/>
<td/>
<td valign="top" align="center">2,130</td>
</tr>
<tr>
<td valign="top" align="left">Average intron length (bases)</td>
<td/>
<td/>
<td valign="top" align="center">1,311</td>
</tr>
<tr>
<td valign="top" align="left">Average exons/gene</td>
<td/>
<td/>
<td valign="top" align="center">10.33</td>
</tr>
<tr>
<td valign="top" align="left">Average introns/gene</td>
<td/>
<td/>
<td valign="top" align="center">9.33</td>
</tr>
<tr>
<td valign="top" align="left">No. of transfer RNAs</td>
<td/>
<td/>
<td valign="top" align="center">14,247</td>
</tr>
<tr>
<td valign="top" align="left">No. of ribosomal RNAs</td>
<td/>
<td/>
<td valign="top" align="center">308</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Characteristics of the <italic>Nymphon striatum</italic> genome assembly. <bold>(A)</bold> The length distributions of the gap&#x02013;closed scaffolds; <bold>(B)</bold> <italic>ab initio</italic> predicted repetitive elements and their subclass distributions; <bold>(C)</bold> a Venn diagram of the orthologous clusters among five chelicerate species; <bold>(D)</bold> the phylogenetic relationship of <italic>N. striatum</italic> with other six ecdysozoan species; <italic>Caenorhabditis elegans</italic> was selected as an outgroup taxon for the analysis. For each node, its bootstrap support value and the posterior probability are indicated at the base of the node.</p></caption>
<graphic xlink:href="fevo-08-554164-g0001.tif"/>
</fig>
<p>An <italic>ab initio</italic> repeat element prediction resulted in a total of 7.14% of the genomic sequences annotated as repetitive sequences (<xref ref-type="table" rid="T1">Table 1C</xref>). Among these repeat elements, short interspersed nuclear elements (SINEs) accounted for 28.20% of the total length of the annotated repeat sequences, and they were recorded as the most enriched repeats from the <italic>N. striatum</italic> genome. The SINEs were followed by simple repeats (24.09%), small RNAs (21.35%), and non-SINE interspersed elements (20.82%), while satellites (0.36%) occurred the least among the categorized repeat elements (<xref ref-type="fig" rid="F1">Figure 1B</xref>). The final <italic>N. striatum</italic> genome consisted of 28,539 genes which spanned 56.01% of the total genomic length. Additionally, 14,247 transfer RNA and 308 ribosomal RNA genes were annotated (<xref ref-type="table" rid="T1">Table 1D</xref>). The summed repetitive element contents of 7.14% and gene span of 51.06% were significantly different when they were compared to those of other chelicerate genomes used in the basic comparative analyses of this current study. The ortholog analysis conducted using six ecdysozoan genomes (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table 3</xref>) resulted in 7,597 orthologous clusters and 3,888 singletons for <italic>N. striatum</italic>, with 4,493 orthologous clusters shared within four chelicerate species (<xref ref-type="fig" rid="F1">Figure 1C</xref>). Phylogenetic tree reconstruction strongly supported the earliest-branching position of <italic>N. striatum</italic> nested in the monophyletic Chelicerata (<xref ref-type="fig" rid="F1">Figure 1D</xref>) with maximum bootstrap support values (100%) and posterior probabilities (1.0). Nevertheless, more terrestrial arachnid taxa are required to be sampled in further analysis to provide a conclusion about a contentious phylogenetic position of horseshoe crabs deeply nested in the Arachnida that the recent phylogenomic studies on the Euchelicerata have debated (Sharma et al., <xref ref-type="bibr" rid="B31">2014</xref>; Ballesteros and Sharma, <xref ref-type="bibr" rid="B2">2019</xref>; Lozano-Fernandez et al., <xref ref-type="bibr" rid="B18">2019</xref>). To sum up, we suggest that the draft <italic>N. striatum</italic> genome that we assembled is nearly complete and provides the first reference genome representing pycnogonids.</p>
</sec>
<sec sec-type="materials and methods" id="s3">
<title>Materials and Methods</title>
<sec>
<title>Sample Collection and Sequencing</title>
<p>We collected 40 individuals of <italic>N. striatum</italic> at Sacheon-hang, (37.82609&#x000B0;N, 128.93379&#x000B0;E, at a depth of 32 m, on 2018.07.12., NCBI BioSample accession ID: SAMN13567730), South Korea by SCUBA diving. These 40 sea spiders were brought to the laboratory alive, and then pooled together to compensate for the small size of the organisms and to ensure that the amount and quality of extracted DNA are acceptable for PacBio sequencing. All 40 sea spider individuals were collected from a single population at the same location to minimize the heterozygosity of the sequenced genomic reads. These pooled sea spiders were then buffered with RNAlater&#x02122; (Thermo Fisher Scientific, MA, USA) and lysed using QIAzol Reagent (Qiagen, MA, USA) according to the manufacturer&#x00027;s protocols. To isolate the DNA, the lysate was centrifuged according to the QIAzol Reagent protocol. The DNA sample was then extracted from the interphase of the lysate using the MG Genomic DNA Purification kit (Macrogen Inc, Seoul, Korea). The extracted DNA sample was quantified by NanoDrop 1000 spectrometer (Qiagen, MA, USA) and qualified using a 2100 Bioanalyzer (Agilent Technologies, CA, USA).</p>
<p>We generated and sequenced two copies of genomic Illumina Paired&#x02013;end (PE) libiraries (Illumina HiSeq X Ten, read-length 151 bp, insertion size 350 bp), five different insert&#x02013;sized Mate pair libraries (Illumina NovaSeq 6000, read-length 101 bp, insertion sizes of 550 bp, 3 kb, 5 kb, 8 kb, 10 kb), and a single PacBio library (PacBio Sequel, SMRTbell 20 kb library) for <italic>N. striatum</italic> (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table 1</xref>). These libraries for Illumina and PacBio sequencing were prepared using their respective manufacturer&#x00027;s instructions. The Illumina platforms generated 136.28 Gb of genomic PE raw reads and 287.98 Gb of Mate pair raw reads, and the PacBio Sequel (Pacific Biosciences, CA, USA) platform generated 84.83 Gb of long&#x02013;read sequences with their N50 estimated as 19.43 Kb.</p>
</sec>
<sec>
<title><italic>De novo</italic> Assembly of <italic>N. striatum</italic> Genome</title>
<p><italic>De novo</italic> sequenced genomic Illumina reads were assessed using FastQC v0.11.7 (Mar&#x000E7;ais and Kingsford, <xref ref-type="bibr" rid="B22">2011</xref>) and then underwent adapter trimming and filtering (Q &#x0003E; 30) using our in&#x02013;house scripts. These genomic PE reads were subjected to the genome survey by Jellyfish v2.2.10 using its configurations of count step (-C -c 3 -s 100000000), merge step (default parameter), histo step (-h 10000000000), and <italic>k</italic>&#x02013;mer sizes (17, 21, 25 bp). The HGAP 4 (Chin et al., <xref ref-type="bibr" rid="B5">2013</xref>) assembly application was used to assemble 84.83 Gb of PacBio subreads into contigs with its default operating options for alignment, assembly, consensus, and polishing using the Arrow application. These assembled contigs were error&#x02013;corrected by mapping filtered genomic PE sequences using default parameters of Pilon v1.21, followed by additional polishing by mapping PacBio reads using SMRT Link (v6.0.0.47841) to obtain consensus genomic contig sequences. To reduce these haplotypic contigs from our initial assembly, Purge Haplotigs and the reduction module of Redundans were applied with their default operating parameters, respectively. The statistics and BUSCO assessments of the two alternatives of curated contig-level assembly were then compared. Since the Purge Haplotgis-curated contig-level genome showed superior statistics of its assemblage (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table 2A</xref>) and duplicated complete BUSCO gene contents (16.79 vs. 34.8%, <xref ref-type="supplementary-material" rid="SM1">Supplementary Table 2B</xref>) than those of Redundans curated one, it was selected for the downstream analyses. The K&#x02013;mer analysis toolkit (Mapleson et al., <xref ref-type="bibr" rid="B21">2017</xref>) was used to validate contig&#x02013;level assemblies before and after purging haplotigs by mapping the PE reads against the genome assemblies with using default parameters. The Scaffolding Pre-assembled Contigs after Extension (SSPACE, Boetzer et al., <xref ref-type="bibr" rid="B3">2010</xref>) program was used to scaffold haplotig&#x02013;purged contigs with mate pair reads of five insertion sizes. The gaps between genomic scaffolds were closed using PBJelly (English et al., <xref ref-type="bibr" rid="B9">2012</xref>) and GMcloser (Kosugi et al., <xref ref-type="bibr" rid="B17">2015</xref>). After gap closing, the scaffolds were polished once more using the SMRT Link to finalize the scaffold&#x02013;level of the draft genome of <italic>N. striatum</italic>. BUSCO v2 was applied to assess our three versions of genome assemblies at each respective step (<xref ref-type="table" rid="T1">Table 1B</xref>) using the eukaryote_odb9 and arthropoda_odb9 databases. Repetitive sequences of the draft genome were annotated <italic>ab initio</italic> and then masked for the gene annotation procedure using RepeatMasker v4.0.6 (Tarailo-Graovac and Chen, <xref ref-type="bibr" rid="B34">2009</xref>) with our customized RepBase library (based on RepBase 24.03), which is available in the online <bold>Supplementary Data</bold>.</p>
</sec>
<sec>
<title>Genomic Annotations</title>
<p>The NCBI reference genome datasets of three chelicerates (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table 3</xref>) were downloaded for the downstream steps of genomic annotations. <italic>De novo</italic> transcriptome sequencing was conducted, nevertheless the sequenced transcriptomic reads were abandoned due to their low quality that the pooling procedure of specimens potentially resulted in. Therefore, GeMoMa (Keilwagen et al., <xref ref-type="bibr" rid="B15">2018</xref>) was used to predict open reading frames with homology-based approach, based on the gap-closed <italic>N. striatum</italic> genome and transcriptome datasets of three reference chelicerate genomes. The Seqping v0.1.33 pipeline (Chan et al., <xref ref-type="bibr" rid="B4">2017</xref>) was used to conduct gene model prediction and annotation, based on the direct evidence of reference protein sets from three chelicerate genomes and <italic>ab initio</italic> predicted gene models, which were trained using the evidence. Three chelicerate protein sequences were manually curated to reduce redundant copies, and then used as a reference. Two different Seqping associated tools, GlimmerHMM and AUGUSTUS were trained using these protein reference data, and they were used to predict gene models. MAKER2 (v2.31.8) was used to combine the homology&#x02013;based and <italic>ab initio</italic> predicted gene models to construct a consensus set of annotated genes (Holt and Yandell, <xref ref-type="bibr" rid="B13">2011</xref>; Chan et al., <xref ref-type="bibr" rid="B4">2017</xref>). Then, these gene models were functionally annotated using BLAST (v2.7.1&#x0002B;) against the bioinformatics databases which were applied for the research of the <italic>Amphibalanus amphitrite</italic> reference genome (Kim et al., <xref ref-type="bibr" rid="B16">2019</xref>) (<bold>Supplementary Materials</bold>).</p>
</sec>
<sec>
<title>Orthologous Gene Families Analysis and Phylogenomic Assays</title>
<p>Six ecdysozoan reference genome d<bold>ataset</bold>s were selected and downloaded (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table 3</xref>) from NCBI for analyzing orthologous genes using OrthoMCL v2.0.9 (Fischer et al., <xref ref-type="bibr" rid="B10">2011</xref>). Datasets of non&#x02013;chelicerate species were excluded for increasing the visual legibility of the Venn diagram of analyzed orthologs (<xref ref-type="fig" rid="F1">Figure 1C</xref>). To construct phylogenetic tree, we curated 106 non&#x02013;redundant orthologous genes that were shared with all seven analyzed genomes. These orthologs were aligned and then BeforePhylo (<ext-link ext-link-type="uri" xlink:href="https://github.com/qiyunzhu/BeforePhylo">https://github.com/qiyunzhu/BeforePhylo</ext-link>) was used to concatenate them into a supermatrix. MAFFT was used to obtain an alignment of these concatenated protein sequences (Katoh et al., <xref ref-type="bibr" rid="B14">2017</xref>). To analyze phylogenetic relationship using maximum likelihood and Bayesian inference methods, RAxML 8.2.12 (Stamatakis, <xref ref-type="bibr" rid="B33">2014</xref>) and MrBayes 3.2.7 (Ronquist et al., <xref ref-type="bibr" rid="B28">2012</xref>) were used with 1,000 and 3,000,000 pseudoreplication values, respectively.</p>
</sec>
</sec>
<sec sec-type="data-availability-statement" id="s4">
<title>Data Availability Statement</title>
<p>The datasets generated for this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: NCBI under accession numbers PRJNA595812 and SAMN13567730.</p>
</sec>
<sec id="s5">
<title>Author Contributions</title>
<p>WK and SR conceived the study. J-HJ and HK performed the bioinformatics analyses and wrote the manuscript. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="s6">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack><p>We appreciate our colleague at Seoul National University, Damin Lee, about his contribution to collect and identify <italic>Nymphon striatum</italic> samples for this research.</p></ack>
<sec sec-type="supplementary-material" id="s7">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fevo.2020.554164/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fevo.2020.554164/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.DOCX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image_1.TIF" id="SM2" mimetype="image/tif" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image_2.TIF" id="SM3" mimetype="image/tif" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image_3.TIF" id="SM4" mimetype="image/tif" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image_4.TIF" id="SM5" mimetype="image/tif" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ballesteros</surname> <given-names>J. A.</given-names></name> <name><surname>Setton</surname> <given-names>V. W.</given-names></name> <name><surname>Santib&#x000E1;&#x000F1;ez L&#x000F3;pez</surname> <given-names>C. E.</given-names></name> <name><surname>Arango</surname> <given-names>C. P.</given-names></name> <name><surname>Brenneis</surname> <given-names>G.</given-names></name> <name><surname>Brix</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Phylogenomic resolution of sea spider diversification through integration of multiple data classes</article-title>. <source>Mol. Biol. Evol.</source> <fpage>mssa228</fpage>. <pub-id pub-id-type="doi">10.1093/molbev/msaa228</pub-id><pub-id pub-id-type="pmid">32915961</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ballesteros</surname> <given-names>J. A.</given-names></name> <name><surname>Sharma</surname> <given-names>P. P.</given-names></name></person-group> (<year>2019</year>). <article-title>A critical appraisal of the placement of Xiphosura (Chelicerata) with account of known sources of phylogenetic error</article-title>. <source>Syst. Biol</source>. <volume>68</volume>, <fpage>896</fpage>&#x02013;<lpage>917</lpage>. <pub-id pub-id-type="doi">10.1093/sysbio/syz011</pub-id><pub-id pub-id-type="pmid">30917194</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boetzer</surname> <given-names>M.</given-names></name> <name><surname>Henkel</surname> <given-names>C. V.</given-names></name> <name><surname>Jansen</surname> <given-names>H. J.</given-names></name> <name><surname>Butler</surname> <given-names>D.</given-names></name> <name><surname>Pirovano</surname> <given-names>W.</given-names></name></person-group> (<year>2010</year>). <article-title>Scaffolding pre-assembled contigs using SSPACE</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>578</fpage>&#x02013;<lpage>579</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btq683</pub-id><pub-id pub-id-type="pmid">21149342</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chan</surname> <given-names>K. L.</given-names></name> <name><surname>Rosli</surname> <given-names>R.</given-names></name> <name><surname>Tatarinova</surname> <given-names>T. V.</given-names></name> <name><surname>Hogan</surname> <given-names>M.</given-names></name> <name><surname>Firdaus-Raih</surname> <given-names>M.</given-names></name> <name><surname>Low</surname> <given-names>E. T. L.</given-names></name></person-group> (<year>2017</year>). <article-title>Seqping: gene prediction pipeline for plant genomes using self-training gene models and transcriptomic data</article-title>. <source>BMC Bioinform.</source> <volume>18</volume>:<fpage>29</fpage>. <pub-id pub-id-type="doi">10.1101/038018</pub-id><pub-id pub-id-type="pmid">28466793</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chin</surname> <given-names>C. S.</given-names></name> <name><surname>Alexander</surname> <given-names>D. H.</given-names></name> <name><surname>Marks</surname> <given-names>P.</given-names></name> <name><surname>Klammer</surname> <given-names>A. A.</given-names></name> <name><surname>Drake</surname> <given-names>J.</given-names></name> <name><surname>Heiner</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data</article-title>. <source>Nat. Methods</source> <volume>10</volume>:<fpage>563</fpage>. <pub-id pub-id-type="doi">10.1038/nmeth.2474</pub-id><pub-id pub-id-type="pmid">23644548</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dietz</surname> <given-names>L.</given-names></name> <name><surname>D&#x000F6;mel</surname> <given-names>J. S.</given-names></name> <name><surname>Leese</surname> <given-names>F.</given-names></name> <name><surname>Mahon</surname> <given-names>A. R.</given-names></name> <name><surname>Mayer</surname> <given-names>C.</given-names></name></person-group> (<year>2019</year>). <article-title>Phylogenomics of the longitarsal Colossendeidae: the evolutionary history of an Antarctic sea spider radiation</article-title>. <source>Mol. Phylogenet. Evol.</source> <volume>136</volume>, <fpage>206</fpage>&#x02013;<lpage>214</lpage>. <pub-id pub-id-type="doi">10.1016/j.ympev.2019.04.017</pub-id><pub-id pub-id-type="pmid">31002869</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dunlop</surname> <given-names>J. A.</given-names></name> <name><surname>Arango</surname> <given-names>C. P.</given-names></name></person-group> (<year>2005</year>). <article-title>Pycnogonid affinities: a review</article-title>. <source>J. Zool. Syst. Evol. Res.</source> <volume>43</volume>, <fpage>8</fpage>&#x02013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1111/j.1439-0469.2004.00284.x</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dunlop</surname> <given-names>J. A.</given-names></name> <name><surname>Selden</surname> <given-names>P. A.</given-names></name></person-group> (<year>2009</year>). <article-title>Calibrating the chelicerate clock: a paleontological reply to Jeyaprakash and Hoy</article-title>. <source>Exp. Appl. Acarol.</source> <volume>48</volume>:<fpage>183</fpage>. <pub-id pub-id-type="doi">10.1007/s10493-009-9247-1</pub-id><pub-id pub-id-type="pmid">19199056</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>English</surname> <given-names>A. C.</given-names></name> <name><surname>Richards</surname> <given-names>S.</given-names></name> <name><surname>Han</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>M.</given-names></name> <name><surname>Vee</surname> <given-names>V.</given-names></name> <name><surname>Qu</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Mind the gap: upgrading genomes with Pacific biosciences RS long-read sequencing technology</article-title>. <source>PLoS ONE</source> <volume>7</volume>:<fpage>e47768</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0047768</pub-id><pub-id pub-id-type="pmid">23185243</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fischer</surname> <given-names>S.</given-names></name> <name><surname>Brunk</surname> <given-names>B. P.</given-names></name> <name><surname>Chen</surname> <given-names>F.</given-names></name> <name><surname>Gao</surname> <given-names>X.</given-names></name> <name><surname>Harb</surname> <given-names>O. S.</given-names></name> <name><surname>Iodice</surname> <given-names>J. B.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Using OrthoMCL to assign proteins to OrthoMCL-DB groups or to cluster proteomes into new ortholog groups</article-title>. <source>Curr. Protoc. Bioinform.</source> <volume>35</volume>, <fpage>6</fpage>.12.1&#x02013;6.12.19. <pub-id pub-id-type="doi">10.1002/0471250953.bi0612s35</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garb</surname> <given-names>J. E.</given-names></name> <name><surname>Sharma</surname> <given-names>P. P.</given-names></name> <name><surname>Ayoub</surname> <given-names>N. A.</given-names></name></person-group> (<year>2018</year>). <article-title>Recent progress and prospects for advancing arachnid genomics</article-title>. <source>Curr. Opin. Insect Sci.</source> <volume>25</volume>, <fpage>51</fpage>&#x02013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1016/j.cois.2017.11.005</pub-id><pub-id pub-id-type="pmid">29602362</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giribet</surname> <given-names>G.</given-names></name> <name><surname>Edgecombe</surname> <given-names>G. D.</given-names></name> <name><surname>Wheeler</surname> <given-names>W. C.</given-names></name></person-group> (<year>2001</year>). <article-title>Arthropod phylogeny based on eight molecular loci and morphology</article-title>. <source>Nature</source> <volume>413</volume>, <fpage>157</fpage>&#x02013;<lpage>161</lpage>. <pub-id pub-id-type="doi">10.1038/35093097</pub-id><pub-id pub-id-type="pmid">11557979</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holt</surname> <given-names>C.</given-names></name> <name><surname>Yandell</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects</article-title>. <source>BMC Bioinform.</source> <volume>12</volume>:<fpage>491</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-12-491</pub-id><pub-id pub-id-type="pmid">22192575</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Katoh</surname> <given-names>K.</given-names></name> <name><surname>Rozewicki</surname> <given-names>J.</given-names></name> <name><surname>Yamada</surname> <given-names>K. D.</given-names></name></person-group> (<year>2017</year>). <article-title>MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization</article-title>. <source>Brief. Bioinform</source>. <volume>20</volume>, <fpage>1160</fpage>&#x02013;<lpage>1166</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbx108</pub-id><pub-id pub-id-type="pmid">28968734</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Keilwagen</surname> <given-names>J.</given-names></name> <name><surname>Hartung</surname> <given-names>F.</given-names></name> <name><surname>Paulini</surname> <given-names>M.</given-names></name> <name><surname>Twardziok</surname> <given-names>S. O.</given-names></name> <name><surname>Grau</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>Combining RNA-seq data and homology-based gene prediction for plants, animals, and fungi</article-title>. <source>BMC Bioinform.</source> <volume>19</volume>, <fpage>1</fpage>&#x02013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1186/s12859-018-2203-5</pub-id><pub-id pub-id-type="pmid">29843602</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>J.-H.</given-names></name> <name><surname>Kim</surname> <given-names>H. K.</given-names></name> <name><surname>Kim</surname> <given-names>H.</given-names></name> <name><surname>Chan</surname> <given-names>B. K. K.</given-names></name> <name><surname>Kang</surname> <given-names>S.</given-names></name> <name><surname>Kim</surname> <given-names>W.</given-names></name></person-group> (<year>2019</year>). <article-title>Draft genome assembly of a fouling barnacle, <italic>Amphibalanus amphitrite</italic> (Darwin, 1854): the first reference genome for Thecostraca</article-title>. <source>Front. Ecol. Evol</source>. <volume>7</volume>:<fpage>465</fpage>. <pub-id pub-id-type="doi">10.3389/fevo.2019.00465</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kosugi</surname> <given-names>S.</given-names></name> <name><surname>Hirakawa</surname> <given-names>H.</given-names></name> <name><surname>Tabata</surname> <given-names>S.</given-names></name></person-group> (<year>2015</year>). <article-title>GMcloser: closing gaps in assemblies accurately with a likelihood-based selection of contig or long-read alignments</article-title>. <source>Bioinformatics</source> <volume>31</volume>, <fpage>3733</fpage>&#x02013;<lpage>3741</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btv465</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lozano-Fernandez</surname> <given-names>J.</given-names></name> <name><surname>Tanner</surname> <given-names>A. R.</given-names></name> <name><surname>Giacomelli</surname> <given-names>M.</given-names></name> <name><surname>Carton</surname> <given-names>R.</given-names></name> <name><surname>Vinther</surname> <given-names>J.</given-names></name> <name><surname>Edgecombe</surname> <given-names>G. D.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Increasing species sampling in chelicerate genomic-scale datasets provides support for monophyly of Acari and Arachnida</article-title>. <source>Nat. Commun.</source> <volume>10</volume>:<fpage>2295</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-019-12259-6</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lozano-Fernandez</surname> <given-names>J.</given-names></name> <name><surname>Tanner</surname> <given-names>A. R.</given-names></name> <name><surname>Puttick</surname> <given-names>M. N.</given-names></name> <name><surname>Vinther</surname> <given-names>J.</given-names></name> <name><surname>Edgecombe</surname> <given-names>G. D.</given-names></name> <name><surname>Pisani</surname> <given-names>D.</given-names></name></person-group> (<year>2020</year>). <article-title>A Cambrian&#x02013;Ordovician terrestrialization of arachnids</article-title>. <source>Front. Genet.</source> <volume>11</volume>:<fpage>182</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2020.00182</pub-id><pub-id pub-id-type="pmid">32218802</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Machner</surname> <given-names>J.</given-names></name> <name><surname>Scholtz</surname> <given-names>G.</given-names></name></person-group> (<year>2010</year>). <article-title>A scanning electron microscopy study of the embryonic development of <italic>Pycnogonum litorale</italic> (Arthropoda, Pycnogonida)</article-title>. <source>J. Morphol.</source> <volume>271</volume>, <fpage>1306</fpage>&#x02013;<lpage>1318</lpage>. <pub-id pub-id-type="doi">10.1002/jmor.10871</pub-id><pub-id pub-id-type="pmid">20658562</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mapleson</surname> <given-names>D.</given-names></name> <name><surname>Accinelli</surname> <given-names>G. G.</given-names></name> <name><surname>Kettleborough</surname> <given-names>G.</given-names></name> <name><surname>Wright</surname> <given-names>J.</given-names></name> <name><surname>Clavijo</surname> <given-names>B. J.</given-names></name></person-group> (<year>2017</year>). <article-title>KAT: a K&#x02013;mer analysis toolkit to quality control NGS datasets and genome assemblies</article-title>. <source>Bioinformatics</source> <volume>33</volume>, <fpage>574</fpage>&#x02013;<lpage>576</lpage>. <pub-id pub-id-type="doi">10.1101/064733</pub-id><pub-id pub-id-type="pmid">27797770</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mar&#x000E7;ais</surname> <given-names>G.</given-names></name> <name><surname>Kingsford</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title>A fast, lock-free approach for efficient parallel counting of occurrences of k-mers</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>764</fpage>&#x02013;<lpage>770</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btr011</pub-id><pub-id pub-id-type="pmid">21217122</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meusemann</surname> <given-names>K.</given-names></name> <name><surname>von Reumont</surname> <given-names>B. M.</given-names></name> <name><surname>Simon</surname> <given-names>S.</given-names></name> <name><surname>Roeding</surname> <given-names>F.</given-names></name> <name><surname>Strauss</surname> <given-names>S.</given-names></name> <name><surname>K&#x000FC;ck</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>A phylogenomic approach to resolve the arthropod tree of life</article-title>. <source>Mol. Biol. Evol.</source> <volume>27</volume>, <fpage>2451</fpage>&#x02013;<lpage>2464</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msq130</pub-id><pub-id pub-id-type="pmid">20534705</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pryszcz</surname> <given-names>L. P.</given-names></name> <name><surname>Gabaldo&#x000F3;n</surname> <given-names>T.</given-names></name></person-group> (<year>2016</year>). <article-title>Redundans: an assembly pipeline for highly heterozygous genomes</article-title>. <source>Nucleic Acids Res.</source> <volume>44</volume>:<fpage>e113</fpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw294</pub-id><pub-id pub-id-type="pmid">27131372</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Regier</surname> <given-names>J. C.</given-names></name> <name><surname>Shultz</surname> <given-names>J. W.</given-names></name> <name><surname>Zwick</surname> <given-names>A.</given-names></name> <name><surname>Hussey</surname> <given-names>A.</given-names></name> <name><surname>Ball</surname> <given-names>B.</given-names></name> <name><surname>Wetzer</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Arthropod relationships revealed by phylogenomic analysis of nuclear protein-coding sequences</article-title>. <source>Nature</source> <volume>463</volume>, <fpage>1079</fpage>&#x02013;<lpage>1083</lpage>. <pub-id pub-id-type="doi">10.1038/nature08742</pub-id><pub-id pub-id-type="pmid">20147900</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rehm</surname> <given-names>P.</given-names></name> <name><surname>Borner</surname> <given-names>J.</given-names></name> <name><surname>Meusemann</surname> <given-names>K.</given-names></name> <name><surname>von Reumont</surname> <given-names>B. M.</given-names></name> <name><surname>Simon</surname> <given-names>S.</given-names></name> <name><surname>Hadrys</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Dating the arthropod tree based on large-scale transcriptome data</article-title>. <source>Mol. Phylogenet. Evol.</source> <volume>61</volume>, <fpage>880</fpage>&#x02013;<lpage>887</lpage>. <pub-id pub-id-type="doi">10.1016/j.ympev.2011.09.003</pub-id><pub-id pub-id-type="pmid">21945788</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roach</surname> <given-names>M. J.</given-names></name> <name><surname>Schmidt</surname> <given-names>S. A.</given-names></name> <name><surname>Borneman</surname> <given-names>A. R.</given-names></name></person-group> (<year>2018</year>). <article-title>Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies</article-title>. <source>BMC Bioinform.</source> <volume>19</volume>:<fpage>460</fpage>. <pub-id pub-id-type="doi">10.1186/s12859-018-2485-7</pub-id><pub-id pub-id-type="pmid">30497373</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ronquist</surname> <given-names>F.</given-names></name> <name><surname>Teslenko</surname> <given-names>M.</given-names></name> <name><surname>van der Mark</surname> <given-names>P.</given-names></name> <name><surname>Ayres</surname> <given-names>D. L.</given-names></name> <name><surname>Darling</surname> <given-names>A.</given-names></name> <name><surname>H&#x000F6;hna</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space</article-title>. <source>Syst. Biol.</source> <volume>61</volume>, <fpage>539</fpage>&#x02013;<lpage>542</lpage>. <pub-id pub-id-type="doi">10.1093/sysbio/sys029</pub-id><pub-id pub-id-type="pmid">22357727</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rota-Stabelli</surname> <given-names>O.</given-names></name> <name><surname>Daley</surname> <given-names>A. C.</given-names></name> <name><surname>Pisani</surname> <given-names>D.</given-names></name></person-group> (<year>2013</year>). <article-title>Molecular timetrees reveal a Cambrian colonization of land and a new scenario for ecdysozoan evolution</article-title>. <source>Curr. Biol.</source> <volume>23</volume>, <fpage>392</fpage>&#x02013;<lpage>398</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2013.01.026</pub-id><pub-id pub-id-type="pmid">23375891</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sabroux</surname> <given-names>R.</given-names></name> <name><surname>Audo</surname> <given-names>D.</given-names></name> <name><surname>Charbonnier</surname> <given-names>S.</given-names></name> <name><surname>Corbari</surname> <given-names>L.</given-names></name> <name><surname>Hassanin</surname> <given-names>A.</given-names></name></person-group> (<year>2019</year>). <article-title>150-million-year-old sea spiders (Pycnogonida: Pantopoda) of Solnhofen</article-title>. <source>J. Syst. Palaeontol.</source> <volume>17</volume>, <fpage>1927</fpage>&#x02013;<lpage>1938</lpage>. <pub-id pub-id-type="doi">10.1080/14772019.2019.1571534</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharma</surname> <given-names>P. P.</given-names></name> <name><surname>Kaluziak</surname> <given-names>S. T.</given-names></name> <name><surname>P&#x000E9;rez-Porro</surname> <given-names>A. R.</given-names></name> <name><surname>Gonz&#x000E1;lez</surname> <given-names>V. L.</given-names></name> <name><surname>Hormiga</surname> <given-names>G.</given-names></name> <name><surname>Wheeler</surname> <given-names>W. C.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Phylogenomic interrogation of Arachnida reveals systemic conflicts in phylogenetic signal</article-title>. <source>Mol. Biol. Evol.</source> <volume>31</volume>, <fpage>2963</fpage>&#x02013;<lpage>2984</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msu235</pub-id><pub-id pub-id-type="pmid">25107551</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sim&#x000E3;o</surname> <given-names>F. A.</given-names></name> <name><surname>Waterhouse</surname> <given-names>R. M.</given-names></name> <name><surname>Ioannidis</surname> <given-names>P.</given-names></name> <name><surname>Kriventseva</surname> <given-names>E. V.</given-names></name> <name><surname>Zdobnov</surname> <given-names>E. M.</given-names></name></person-group> (<year>2015</year>). <article-title>BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs</article-title>. <source>Bioinformatics</source> <volume>31</volume>, <fpage>3210</fpage>&#x02013;<lpage>3212</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btv351</pub-id><pub-id pub-id-type="pmid">26059717</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stamatakis</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>1312</fpage>&#x02013;<lpage>1313</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu033</pub-id><pub-id pub-id-type="pmid">24451623</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tarailo-Graovac</surname> <given-names>M.</given-names></name> <name><surname>Chen</surname> <given-names>N.</given-names></name></person-group> (<year>2009</year>). <article-title>Using RepeatMasker to identify repetitive elements in genomic sequences</article-title>. <source>Curr. Protoc. Bioinform</source>. <volume>25</volume>, <fpage>4</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1002/0471250953.bi0410s25</pub-id><pub-id pub-id-type="pmid">19274634</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Zrzav&#x000FD;</surname> <given-names>J.</given-names></name> <name><surname>Hyp&#x00161;a</surname> <given-names>V.</given-names></name> <name><surname>Vl&#x000E1;&#x00161;kov&#x000E1;</surname> <given-names>M.</given-names></name></person-group> (<year>1998</year>). <article-title>&#x0201C;Arthropod phylogeny: taxonomic congruence, total evidence and conditional combination approaches to morphological and molecular data sets,&#x0201D;</article-title> in <source>Arthropod Relationships. The Systematics Association Special Volume Series</source>, <volume>Vol. 55</volume>, eds R. A. Fortey, and R. H. Thomas (<publisher-loc>Dordrecht</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>97</fpage>&#x02013;<lpage>107</lpage>.</citation></ref>
</ref-list>
<fn-group>
<fn fn-type="financial-disclosure"><p><bold>Funding.</bold> This research was supported by the Collaborative Genome Program and the Marine Biotechnology Program of the Korea Institute of Marine Science and Technology Promotion (KIMST) funded by the Ministry of Ocean and Fisheries (MOF) (Nos. 20180430 and 20170431) Korea.</p></fn>
</fn-group>
</back>
</article>