<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article
  PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2019.00708</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Revisiting the Phylogenetic History of Helminths Through Genomics, the Case of the New <italic>Echinococcus oligarthrus</italic> Genome</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Maldonado</surname>
<given-names>Lucas L.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/714235"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Arrabal</surname>
<given-names>Juan Pablo</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Rosenzvit</surname>
<given-names>Mara Cecilia</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/634836"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Oliveira</surname>
<given-names>Guilherme Corr&#xea;a De</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/38463"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Kamenetzky</surname>
<given-names>Laura</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/633904"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup><institution>IMPaM, CONICET, Facultad de Medicina, Universidad de Buenos Aires</institution>, <addr-line>Buenos Aires</addr-line>, <country>Argentina</country></aff>
<aff id="aff2">
<sup>2</sup><institution>INMet, Instituto Nacional de Medicina Tropical</institution>, <addr-line>Puerto Iguaz&#xfa;</addr-line>, <country>Argentina</country></aff>
<aff id="aff3">
<sup>3</sup><institution>Instituto Tecnol&#xf3;gico Vale</institution>, <addr-line>Bel&#xe9;m</addr-line>, <country>Brazil</country></aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Gabriel Rinaldi, Wellcome Trust Sanger Institute (WT), United Kingdom</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Luis Carlos Guimar&#xe3;es, Federal University of Par&#xe1;, Brazil; Isheng Jason Tsai, Biodiversity Research Center,Academia Sinica, Taiwan</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: Laura Kamenetzky, <email xlink:href="mailto:lkamenetzky@fmed.uba.ar">lkamenetzky@fmed.uba.ar</email>; Lucas L. Maldonado, <email xlink:href="mailto:lucas.l.maldonado@gmail.com">lucas.l.maldonado@gmail.com</email>
</p>
</fn>
<fn fn-type="other" id="fn002">
<p>This article was submitted to Evolutionary and Genomic Microbiology, a section of the journal Frontiers in Genetics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>07</day>
<month>08</month>
<year>2019</year>
</pub-date>
<pub-date pub-type="collection">
<year>2019</year>
</pub-date>
<volume>10</volume>
<elocation-id>708</elocation-id>
<history>
<date date-type="received">
<day>29</day>
<month>03</month>
<year>2019</year>
</date>
<date date-type="accepted">
<day>04</day>
<month>07</month>
<year>2019</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2019 Maldonado, Arrabal, Rosenzvit, Oliveira and Kamenetzky</copyright-statement>
<copyright-year>2019</copyright-year>
<copyright-holder>Maldonado, Arrabal, Rosenzvit, Oliveira and Kamenetzky</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The first parasitic helminth genome sequence was published in 2007; since then, only &#x223c;200&#xa0;genomes have become available, most of them being draft assemblies. Nevertheless, despite the medical and economical global impact of helminthic infections, parasite genomes in public databases are underrepresented. Recently, through an integrative approach involving morphological, genetic, and ecological aspects, we have demonstrated that the complete life cycle of <italic>Echinococcus oligarthrus</italic> (Cestoda: Taeniidae) is present in South America. The neotropical <italic>E. oligarthrus</italic> parasite is capable of developing in any felid species and producing human infections. Neotropical echinococcosis is poorly understood yet and requires a complex medical examination to provide the appropriate intervention. Only a few cases of echinococcosis have been unequivocally identified and reported as a consequence of <italic>E. oligarthrus</italic> infections. Regarding phylogenetics, the analyses of mitogenomes and nuclear datasets have resulted in discordant topologies, and there is no unequivocal taxonomic classification of <italic>Echinococcus</italic> species so far. In this work, we sequenced and assembled the genome of <italic>E. oligarthrus</italic> that was isolated from agoutis (<italic>Dasyprocta azarae</italic>) naturally infected and performed the first comparative genomic study of a neotropical <italic>Echinococcus</italic> species. The <italic>E. oligarthrus</italic> genome assembly consisted of 86.22 Mb which showed &#x223c;90% identity and 76.3% coverage with <italic>Echinococcus multilocularis</italic> and contained the 85.0% of the total expected genes. Genetic variants analysis of whole genome revealed a higher rate of intraspecific genetic variability (23,301 SNPs; 0.22 SNPs/kb) rather than for the genomes of <italic>E. multilocularis</italic> and <italic>Echinococcus canadensis</italic> G7 but lower with respect to <italic>Echinococcus granulosus</italic> G1. Comparative genomics against <italic>E. multilocularis</italic>, <italic>E. granulosus</italic> G1, and <italic>E. canadensis</italic> G7 revealed 38,762, 125,147, and 170,049 homozygous polymorphic sites, respectively, indicating a higher genetic distance between <italic>E. oligarthrus</italic> and <italic>E. granulosus sensu lato</italic> species. The SNP distribution in chromosomes revealed a higher SNP density in the longest chromosomes. Phylogenetic analysis using whole-genome SNPs demonstrated that <italic>E. oligarthrus</italic> is one of the basal species of the genus <italic>Echinococcus</italic> and is phylogenetically closer to <italic>E. multilocularis</italic>. This work sheds light on the <italic>Echinococcus</italic> phylogeny and settles the basis to study sylvatic <italic>Echinococcus</italic> species and their developmental evolutionary features.</p>
</abstract>
<kwd-group>
<kwd><italic>Echinococcus oligarthrus</italic></kwd>
<kwd>genome</kwd>
<kwd>phylogeny</kwd>
<kwd>single nucleotide polymorphism</kwd>
<kwd>chromosomes</kwd>
<kwd>parasites</kwd>
</kwd-group>
<contract-num rid="cn001">MinCyTCAPES BR/RED 1413 (L.K.)</contract-num>
<contract-sponsor id="cn001">Consejo Nacional de Investigaciones Cient&#xed;ficas y T&#xe9;cnicas<named-content content-type="fundref-id">10.13039/501100002923</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">Ministerio de Ciencia y Tecnolog&#xed;a<named-content content-type="fundref-id">10.13039/501100006280</named-content>
</contract-sponsor>
<contract-sponsor id="cn003">Ministerio de Educaci&#xf3;n de la Naci&#xf3;n<named-content content-type="fundref-id">10.13039/501100006667</named-content>
</contract-sponsor>
<counts>
<fig-count count="3"/>
<table-count count="2"/>
<equation-count count="0"/>
<ref-count count="57"/>
<page-count count="13"/>
<word-count count="7641"/>
</counts>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Highlights</title>
<list list-type="bullet">
<list-item>
<p>Whole-genome SNP analysis showed high divergence among <italic>Echinococcus sensu lato</italic>.</p>
</list-item>
<list-item>
<p>
<italic>Echinococcus</italic> chromosomes have differential distribution of single-nucleotide polymorphisms.</p>
</list-item>
<list-item>
<p>Sylvatic <italic>Echinococcus</italic> species are phylogenetically closer to each other than <italic>Echinococcus granulosus sensu lato</italic> species.</p>
</list-item>
<list-item>
<p>Phylogenetic analysis demonstrated that <italic>Echinococcus oligarthrus</italic> is one of the basal species.</p>
</list-item>
</list>
</sec>
<sec id="s2" sec-type="intro">
<title>Introduction</title>
<p>Helminth parasites are a highly diverse group that involves many parasites of biomedical, veterinary, and economic importance including roundworms (nematodes) and flatworms (Platyhelminthes: trematodes and cestodes). Taxonomic assignment of the parasitic helminth taxa is a particularly arduous task. Indeed, parasites are typically difficult to culture and analyze independently of their hosts and the parasite body fossil samples are scarce due to their small size, lack of hard parts, and their lifestyle within the host. In addition, molecular analyses do not often include all the species or include only partial sequences (<xref ref-type="bibr" rid="B13">De Baets et al., 2015</xref>). Among Platyhelminthes, the classification and nomenclature within the genus <italic>Echinococcus</italic> is a controversial topic (<xref ref-type="bibr" rid="B35">McManus, 2013</xref>; <xref ref-type="bibr" rid="B31">Lymbery et al., 2015</xref>). The strobilar stage of these parasites occurs in the small intestine of a definitive carnivore host; the metacestodes develop in the organs of an herbivorous intermediate host that is the prey of the final hosts. In the last decades, several subspecies of <italic>Echinococcus granulosus</italic> were proposed based mainly on the intermediate host specificity (<xref ref-type="bibr" rid="B55">Verster, 1969</xref>). However, most of the early proposed subspecific taxa were relegated to synonymy under the name <italic>E. granulosus</italic> due to their sympatric distributions and because they are indistinguishable at the morphological level (<xref ref-type="bibr" rid="B44">Rausch et al., 1967</xref>). Consequently, based on the production of a distinctive form of echinococcosis in humans, only four species had been retained: <italic>E. granulosus sensu lato</italic>, which causes cystic echinococcosis; <italic>Echinococcus oligarthrus</italic>, which causes unicystic echinococcosis; <italic>Echinococcus vogeli</italic>, which causes polycystic echinococcosis; and <italic>Echinococcus multilocularis</italic>, which causes alveolar echinococcosis. The pathogenicity degree of the echinococcosis infections depends on the characteristics of the metacestode development, which is different in each one of the four species mentioned before (<xref ref-type="bibr" rid="B30">Lymbery, 2017</xref>). Nowadays, genetic and genomic mitochondrial analysis allows the revision of the phylogeny of the <italic>Echinococcus</italic> genus determining the species rank for nine taxa: <italic>E. granulosus sensu stricto</italic>, <italic>Echinococcus canadensis</italic>, <italic>Echinococcus ortleppi</italic>, <italic>Echinococcus equinus</italic>, <italic>Echinococcus felidis</italic>, <italic>E. oligarthrus</italic>, <italic>E. vogeli</italic>, <italic>E. multilocularis</italic>, and <italic>Echinococcus shiquicus</italic> (<xref ref-type="bibr" rid="B56">Xiao et al., 2006</xref>; <xref ref-type="bibr" rid="B37">Nakao et al., 2007</xref>; <xref ref-type="bibr" rid="B20">H&#xfc;ttner et al., 2008</xref>; <xref ref-type="bibr" rid="B36">Nakao et al., 2013a</xref>; <xref ref-type="bibr" rid="B23">Kinkar et al., 2017</xref>). Nuclear DNA has also been used to reconstruct the phylogeny of taeniid parasites, which differs from the phylogeny obtained with mitochondrial data (<xref ref-type="bibr" rid="B47">Saarma et al., 2009</xref>; <xref ref-type="bibr" rid="B24">Knapp et al., 2011</xref>). Even though there are certain common features in the taxonomy regardless of the origin of the molecular data. <italic>E. felidis</italic> and <italic>E. granulosus s. s.</italic> are sister species, and <italic>E. ortleppi</italic> is closely related to the different genotypes of <italic>E. canadensis</italic>. However, the position of the neotropical species, <italic>E. vogeli</italic> and <italic>E. oligarthrus</italic> (basal or non-basal), and whether <italic>E. multilocularis</italic> and <italic>E. shiquicus</italic> are sister species remain unknown (<xref ref-type="bibr" rid="B30">Lymbery, 2017</xref>). Several authors agree that further analyses using more nuclear DNA sequences are required in order to completely resolve the relationships among putative species within the genus (<xref ref-type="bibr" rid="B47">Saarma et al., 2009</xref>; <xref ref-type="bibr" rid="B36">Nakao et al., 2013a</xref>; <xref ref-type="bibr" rid="B30">Lymbery, 2017</xref>). Particularly, neotropical species have been the least studied and only a few cases have been reported and published so far (<xref ref-type="bibr" rid="B10">D&#x2019;Alessandro and Rausch, 2008</xref>; <xref ref-type="bibr" rid="B51">Soares et al., 2013</xref>; <xref ref-type="bibr" rid="B1">Arrabal et al., 2017</xref>). Hence, a better sampling and understanding of the neotropical species will help to resolve the <italic>Echinococcus</italic> phylogeny. In 2013, with the publication of the first tapeworm genomes that included <italic>E. multilocularis</italic> and <italic>E. granulosus</italic> G1 species (<xref ref-type="bibr" rid="B54">Tsai et al., 2013</xref>; <xref ref-type="bibr" rid="B57">Zheng et al., 2013</xref>), the &#x201c;Tapeworm genome era&#x201d; began. In 2017, we sequenced and assembled the <italic>E. canadensis</italic> G7 genome and performed several comparative genomic analyses confirming the species status of the taxa (<xref ref-type="bibr" rid="B32">Maldonado et al., 2017</xref>). In this work, with the aim of obtaining a deeper description of the <italic>Echinococcus</italic> phylogeny, we sequenced the complete <italic>E. oligarthrus</italic> genome and performed whole-genome variant analysis among all the <italic>Echinococcus</italic> species that are currently available. The results presented here demonstrate the basal origin of the neotropical <italic>Echinococcus</italic> species and propose that the use of complete genome data is crucial for the unequivocal helminth phylogenetic studies.</p>
</sec>
<sec id="s3" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="s3_1">
<title>Sample Collection, DNA Extraction, and Next-Generation Sequencing</title>
<sec id="s3_1_1">
<title>Parasites Material</title>
<p>
<italic>E. oligarthrus</italic> cysts were collected from Iguaz&#xfa; National Park, in the North of Misiones province, Argentina. Cysts were obtained from the livers of naturally infected agoutis (<italic>Dasyprocta azarae</italic>). The animals involved in this study were not subjected to any experimental procedure. All the samples used in this study were collected post-mortem from road-killed animals. For genome sequencing purposes, protoscoleces were aseptically removed from the cysts and extensively washed in phosphate buffer saline and visualized under an optical microscope. The species and genotype were determined by sequencing a fragment of the mitochondrial cytochrome C oxidase subunit 1 (COX1) (<xref ref-type="bibr" rid="B1">Arrabal et al., 2017</xref>).</p>
</sec>
<sec id="s3_1_2">
<title>DNA Isolation, Library Construction, and DNA Sequencing</title>
<p>The isolation of high-quality genomic DNA was performed by the phenol/chloroform method as previously described (<xref ref-type="bibr" rid="B32">Maldonado et al., 2017</xref>). Briefly, the samples were quantified using a Qubit Fluorometer (Invitrogen), the quality was evaluated by the OD rate 260/280 and 260/230 using a NanoDrop (ThermoFisher Scientific). MiSeq Illumina libraries were prepared as follows. For each library preparation, 50 ng of DNA was subjected to a random tagmentation reaction, and DNA was simultaneously fragmented and linked to specific adapters using the Nextera<sup>&#xae;</sup> XT DNA Sample Preparation Kit, according to the manufacturer&#x2019;s instructions. Two libraries of 530-bp fragment size were obtained and subjected to 500 sequencing cycles (2 &#xd7; 250 bp) using the MiSeq v2 Reagent Kit. The quality of the Illumina reads was evaluated with FastQC v0.10.1, and the reads were trimmed and end-clipped to a Phred score of 33 using Trimmomatic (<xref ref-type="bibr" rid="B6">Bolger et al., 2014</xref>).</p>
</sec>
</sec>
<sec id="s3_2">
<title>
<italic>De Novo</italic> Assembly of <italic>E. oligarthrus</italic> NGS Reads</title>
<p>The genome of <italic>E. oligarthrus</italic> was assembled from a combination of two paired-end libraries sequenced in the Illumina MiSeq platform. The genome assembly involved several steps. First, a preliminary <italic>de novo</italic> assembly using SPAdes 3.6 (<xref ref-type="bibr" rid="B3">Bankevich et al., 2012</xref>; <xref ref-type="bibr" rid="B48">Safonova et al., 2014</xref>) was performed. The assembly sequences were screened against the NCBI nt database, using Nucleotide&#x2013;Nucleotide BLAST v2.6.0+ (available at ftp://ftp.ncbi.nlm.nih.gov/blast/db/FASTA/nt.gz) in megablast mode, with an e-value cutoff of 1e&#x2212;25 and a culling limit of 2 and using DIAMOND tblastx against SwissProt (<xref ref-type="bibr" rid="B8">Buchfink et al., 2015</xref>). Raw, paired-end Illumina reads were mapped against the assembly using Bowtie2 (<xref ref-type="bibr" rid="B28">Langmead and Salzberg, 2012</xref>). The output was converted to a BAM file using Samtools (<xref ref-type="bibr" rid="B29">Li and Durbin, 2009</xref>). Blobtools v1.1 (<xref ref-type="bibr" rid="B26">Laetsch and Blaxter, 2017</xref>) was used to create taxon-annotated GC-coverage plots for <italic>E. oligarthrus</italic> genome assembly and to identify the target sequences and target reads using as input the Nucleotide&#x2013;Nucleotide and DIAMOND BLAST (<xref ref-type="bibr" rid="B8">Buchfink et al., 2015</xref>) and the raw read mapping results. Sequences that did not match the Platyhelminthes taxon as a top BLAST hit at the phylum level were filtered out. After removing contaminants, sequences that did match as a top BLAST hit at the phylum level and whose base coverage was &gt;4&#xd7; were used to recover the target reads. Before the assembly, we used Jellyfish to create a k-mer histogram and the convergence was analyzed with genomescope in order to evaluate whether after cleaning with blobtools the remaining reads could be used to obtain an acceptable assembly. The target reads extracted from the bam files were used to re-assemble the <italic>E. oligarthrus</italic> genome using SPAdes 3.12 (<xref ref-type="bibr" rid="B3">Bankevich et al., 2012</xref>; <xref ref-type="bibr" rid="B48">Safonova et al., 2014</xref>). The scaffolds of <italic>E. oligarthus</italic> were obtained with Chromosomer (<xref ref-type="bibr" rid="B53">Tamazian et al., 2016</xref>) and using <italic>E. multilocularis</italic> as the reference genome [WormBase ParaSite, Version WBPS12 (WS267)]. Redundant unplaced and unlocalized sequences were discarded from the final assembly. The fragment sequence was considered unplaced if two or more alignments located on different reference sequences or unlocalized if two or more alignments located on the same reference sequences according to <xref ref-type="bibr" rid="B53">Tamazian et al. (2016)</xref>. The contigs that did not map to the reference genome were included in the final assembly. The standard quality metrics of the assembly such as N50, the total number of contigs and the total length of the assembly were evaluated using QUAST (<xref ref-type="bibr" rid="B16">Gurevich et al., 2013</xref>). Putative non-target contigs shorter than 500 bp in length were removed from the final assembly. The completeness of the gene space was validated using BUSCO2 (<xref ref-type="bibr" rid="B49">Sim&#xe3;o et al., 2015</xref>) and eukaryote CEGGs database. Furthermore, the core of cestodes genes [genes contained in all cestode species according to <xref ref-type="bibr" rid="B32">Maldonado et al. (2017)</xref>] was screened using BLAST. Coverage and depth coverage were calculated with custom scripts. Depth coverage refers to the number of times that the same region or position in the reference genome is represented by the assembled genome. Coverage refers to the percentage of the total length of the reference genome that is represented by the assembled genome.</p>
</sec>
<sec id="s3_3">
<title>Gene Prediction and Annotation</title>
<p>The gene annotation of <italic>E. oligarthrus</italic> was performed by transferring the gene annotations from the <italic>E. multilocularis</italic> genome [WormBase ParaSite, Version WBPS12 (WS267)] using a manual and own scripting approach. CDS and protein sequences of <italic>E. multilocularis</italic> and the scaffolds of <italic>E. oligarthrus</italic> were used for this purpose. First, the gene allocated regions were identified using BLAST with an e-value cutoff of 1e&#x2212;10 and one best target hit. The CDS fragments of <italic>E. oligarthrus</italic> were extracted using bedtools (<xref ref-type="bibr" rid="B43">Quinlan and Hall, 2010</xref>; <xref ref-type="bibr" rid="B42">Quinlan, 2014</xref>) and the transcripts were re-assembled using Chromosomer (<xref ref-type="bibr" rid="B53">Tamazian et al., 2016</xref>). GeneWise (<xref ref-type="bibr" rid="B5">Birney et al., 2004</xref>) was used to find the correct frameshift of the CDS and to obtain the final datasets of CDS and proteins. The coordinates of the annotations were assessed using Exonerate (<xref ref-type="bibr" rid="B50">Slater et al., 2005</xref>) and added to a final GFF file. The performance of gene annotation and basic statistics for <italic>E. oligarthrus</italic> gene models, including the average intron/exon lengths and the number of introns, were calculated using Eval (<xref ref-type="bibr" rid="B22">Keibler and Brent, 2003</xref>). The functional gene annotation was performed using InterProScan-5.7-48.0 (<xref ref-type="bibr" rid="B41">Quevillon et al., 2005</xref>) and InterPro2GO databases were used to assign Gene Ontology (GO) terms (<xref ref-type="bibr" rid="B2">Ashburner et al., 2000</xref>). Gene models were subjected to BLAST search (<xref ref-type="bibr" rid="B52">Stemmer et al., 2013</xref>) against UniprotDB and Blast2GO (<uri xlink:href="https://www.blast2go.com/">https://www.blast2go.com/</uri>) was used to define the final annotations and GO terms stats. The GO terms were analyzed using GO.db database implementing R, Bioconductor version: Release (3.8) with custom scripts. Proteins studied in this work were searched using BLAST (<xref ref-type="bibr" rid="B52">Stemmer et al., 2013</xref>) against UniProtKB/Swiss-Prot databases. Protein domains were screened against PFAM, and Prosite databases using PFAM_scan (<xref ref-type="bibr" rid="B15">Finn, 2006</xref>) or HMMscan 3.0. Global pairwise alignments were performed using Needle software (<xref ref-type="bibr" rid="B45">Rice et al., 2000</xref>) in order to identify the orthologous genes related to host&#x2013;parasite interactions between <italic>E. oligarthrus</italic> and <italic>E. multilocularis</italic> (<xref ref-type="bibr" rid="B7">Brehm and Koziol, 2017</xref>). Identity and coverage stats were calculated and only hits with more than 50% of coverage were selected.</p>
</sec>
<sec id="s3_4">
<title>Phylogeny With Nuclear Molecular Markers</title>
<p>Molecular markers of the <italic>Echinococcus</italic> nuclear genome were downloaded from GenBank (<xref ref-type="bibr" rid="B23">Kinkar et al., 2017</xref>). For each locus, homologous regions were extracted from complete <italic>Echinococcus</italic> genomes [WormBase ParaSite, Version WBPS12 (WS267)] and the <italic>E. oligarthrus</italic> genome obtained in this work using custom scripts. Homologous genomic sequences from <italic>Taenia solium</italic> were employed as outgroup. The DNA sequences were aligned with ClustalX (v2.0.12) and multiple alignments were edited with BioEdit (v7.1.3). Phylogenetic analyses of concatenated nuclear markers were performed using the maximum likelihood method and Tamura-Nei model implemented by MEGAX software (<xref ref-type="bibr" rid="B25">Kumar et al., 2018</xref>). The bootstrap consensus trees inferred from 1000 replicates were retained. The percentage of replicate trees in which the associated taxa clustered together in more than 50% in the bootstrap test are shown for each node. The concatenated data resulted in 1552 bp length for seven taxa. A second phylogeny was calculated using MrBayes 3.1.2 (<xref ref-type="bibr" rid="B46">Ronquist et al., 2012</xref>). The evolutionary model was set on generalized time reversible substitution model with gamma-distributed rate variation across the sites and a proportion of invariable sites (GTR +G + I model) with at least 200 samples from the posterior probability distribution, and diagnostics calculated every 1000 generations. We use an exponential prior on the branch length with mean = 0.1 substitutions/site.</p>
</sec>
<sec id="s3_5">
<title>Phylogeny Using Whole-Genome SNPs and Artificial Genome Sequence Construction</title>
<p>Variant calling was performed as described in the &#x201c;Whole-Genome SNP analysis&#x201d; section. Genome-wide SNPs were used to perform phylogeny analysis as follows: first, heterozygous SNPs were removed and only the homozygous SNPs with a depth coverage &gt;20&#xd7; and strictly covered in all the <italic>Echinococcus</italic> species were selected to correct for complete lineage sorting. Afterwards, the homozygous SNP loci were concatenated and the resulting sequence alignment was used to create a phylogenetic tree by implementing the Maximum Likelihood and Bayesian method as above. PartitionFinder 2 (<xref ref-type="bibr" rid="B27">Lanfear et al., 2016</xref>) was used to select the best-fit partitioning schemes and models of evolution for the phylogenetic analysis. The selected model was implemented to perform phylogeny analysis.</p>
</sec>
<sec id="s3_6">
<title>Whole-Genome SNP Analysis</title>
<p>Genome sequences of <italic>Echinococcus</italic> species were ordered in chromosomes according to the last version of the <italic>E. multilocularis</italic> genome [WormBase ParaSite, Version WBPS12 (WS267)] using Chromosomer (<xref ref-type="bibr" rid="B53">Tamazian et al., 2016</xref>) and were used as mapping templates for the variant calling analyses. Variant calling was performed as follows: first, all of the raw reads from <italic>E. canadensis</italic> G7, <italic>E. granulosus</italic> G1, <italic>E. multilocularis</italic>, and <italic>E. oligarthrus</italic> libraries were processed and filtered by quality. Then, the reads were first mapped against their own reference genomes and then against the genomes of the other three <italic>Echinococcus</italic> species using bowtie2 (<xref ref-type="bibr" rid="B28">Langmead and Salzberg, 2012</xref>). Mapping statistics were calculated with Bamtools (<xref ref-type="bibr" rid="B4">Barnett et al., 2011</xref>), and duplications were marked and discarded using picard-tools v-2.18 (<uri xlink:href="http://broadinstitute.github.io/picard/">http://broadinstitute.github.io/picard/</uri>). The variant calling was performed using bcftools (<xref ref-type="bibr" rid="B39">Narasimhan et al., 2016</xref>; <xref ref-type="bibr" rid="B12">Danecek and McCarthy, 2017</xref>) and GATK (<xref ref-type="bibr" rid="B34">McKenna et al., 2010</xref>) using the following parameters: variation frequency was set &gt;40% with a depth coverage of at least 70% of the total mean coverage; the base quality of both reference site and variation site was set to &gt;30. Insertion and deletions (indels) were filtered out with VCFtools (<xref ref-type="bibr" rid="B11">Danecek et al., 2011</xref>), and SNPs with less than 10 bp far from indels were removed to avoid false-positive SNP. In order to annotate the heterozygous and homozygous polymorphic sites, the reads were first mapped against their own reference genomes and then against the other target genomes. Heterozygous sites were retained only if both forward and reverse reads mapped against the reference and alternative allele at a given nucleotide position whose depth coverage was at least 70% of the total mean coverage supporting that position. Homozygous polymorphic sites were annotated if the forward and reverse reads mapped onto the alternative allele with at least 70% of the total mean coverage supporting that position and if there were no reads supporting the reference allele. Homozygous and heterozygous variant sites were registered for all of the species. The Transition/transversion ratios were calculated using VCFtools (<xref ref-type="bibr" rid="B11">Danecek et al., 2011</xref>), and the annotation and classification of SNPs based on the effect of annotated genes were carried out with SnpEff v4.0 (<xref ref-type="bibr" rid="B9">Cingolani et al., 2012</xref>). Graphics were built using R software. (<uri xlink:href="https://www.r-project.org/">https://www.r-project.org/</uri>).</p>
</sec>
</sec>
<sec id="s4" sec-type="results">
<title>Results</title>
<sec id="s4_1">
<title>The <italic>E. oligarthrus</italic> Genome</title>
<p>The <italic>E. oligarthrus</italic> genome was assembled from two paired-end libraries sequenced in Illumina MiSeq. High-quality genomic DNA was purified from two cysts isolated from its natural host, <italic>D. azarae</italic>, which was naturally infected and found in Iguaz&#xfa; National Park, Misiones province, Argentina. Microscopic and macroscopic findings indicated that it was a neotropical <italic>Echinococcus</italic> species. The species and the genotype were confirmed by polymerase chain reaction (PCR) amplification of cytochrome oxidase 1 (cox1) followed by direct sequencing. The <italic>E. oligarthrus</italic> group 1 cox1 sequence was determined (<xref ref-type="bibr" rid="B1">Arrabal et al., 2017</xref>). The <italic>E. oligarthrus</italic> genome assembly was performed using a <italic>de novo</italic> assembly strategy, and the best assembly was chosen based on its quality metrics. Due to the type of the metacestode development that consists of close interaction with the host&#x2019;s tissue and in spite of having done a careful extraction of the parasite material, we sequenced a high proportion of host&#x2019;s DNA. Indeed, preliminary analysis of raw reads revealed that the sequencing yield of parasite DNA was &#x223c;22%, whereas the remaining &#x223c;78% of the sequences was from presumably <italic>D. azarae</italic>, whose genome has not been sequenced yet. Therefore, we performed a screening of potential contaminants before obtaining the final assembly. Since the genome of <italic>D. azarae</italic> is unknown and non-related sequences are available, we made a thorough identification of sequences truly derived from the target genome using Blob tools software (<xref ref-type="bibr" rid="B26">Laetsch and Blaxter, 2017</xref>). This software was specially designed for DNA contamination assessment and is particularly useful for organisms of parasitic origin. Here, we performed a taxonomic selection of the target sequences using the Platyhelminthes taxon (Taxonomy ID: 6157) and created a taxon-annotated GC-coverage plot that allowed us to identify Platyhelminthes contigs and discard all the non-target sequences (<xref ref-type="supplementary-material" rid="SM2">
<bold>Supplementary Figure 1</bold>
</xref>). After removing the contaminants, sequences that did match as a top BLAST hit at the phylum level and whose base coverage was &gt;4&#xd7; and %GC &#x223c; 41% (typical GC content of <italic>Echinococcus</italic> genus) were used to recover the target reads and re-assemble the <italic>E. oligarthrus</italic> genome (see Materials and methods for more details). The final assembly contained 74513 contigs that were further used to obtain the genome scaffolds. Redundant unplaced and unlocalized sequences (19.7 Mb) were removed from the contigs to obtain the final scaffolds assembly (see Materials and methods for more details). The contigs that did not map to the reference genome (did not allocate in scaffolds) were included in the final assembly. The final <italic>E. oligarthrus</italic> genome assembly was composed of 3764 sequences comprising a total of 86.22 Mb with an average GC content of 41% and showed an N50 &#x223c; 10 Mb and &#x223c;20&#xd7; in depth coverage. The assembly metrics for contigs and scaffolds are shown in <xref ref-type="table" rid="T1">
<bold>Table 1</bold>
</xref> and <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Table S1</bold>
</xref>. The nuclear genome of <italic>E. oligarthrus</italic> showed 76.3% coverage and &#x223c;90% identity with the genome of <italic>E. multilocularis</italic> whose genome size is &#x223c;115 Mb [WormBase ParaSite, Version WBPS12 (WS267)]. The percentage average of nucleotide identity among <italic>Echinococcus</italic> chromosomes ranged from 88.7% to 92.7% (<xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Table S2</bold>
</xref>). The assembly also included the <italic>E. oligarthrus</italic> mitochondrial genome composed of one scaffold that was obtained from the contigs Eoli_00665 and Eoli_02629. The scaffold length was 13,893 bp with 98% coverage and 96% nucleotide identity to the <italic>E. oligarthrus</italic> mitochondrial reference genome (GenBank accession number AB208545). As expected, the nucleotide identity of the mitochondrial genomes was lower in comparison with other <italic>Echinococcus</italic> species (<xref ref-type="supplementary-material" rid="SM3">
<bold>Supplementary Figure 2</bold>
</xref>). To assess the completeness of the genome assembly, we evaluated the gene space using &#x201c;Benchmarking Universal Single-Copy Orthologs&#x201d; (BUSCO) (<xref ref-type="bibr" rid="B49">Sim&#xe3;o et al., 2015</xref>), which measures the genome completeness based on evolutionarily informed expectations of gene content. In this analysis, we identified 77.6% (235 of the 303 core genes) that are expected to be present in all metazoans, including 125 complete and duplicated, 110 fragmented, and 68 missing orthologs. The fragmented nature of the assembly may have prevented many genes from meeting the stringent matching criteria implemented by BUSCO. Indeed, the BLAST results suggest that most of the core genes are identifiable in the genome, even though many genes are present as fragments within the assembly. Also, due to the fragmentation level of the genome assembly, a hybrid and <italic>ab initio</italic> gene prediction is likely impracticable. Since all the available gene annotation transfer tools tried here showed low efficiency, the gene annotation was performed by manual gene annotation transfer as described in the Materials and methods section. First, we used BLAST to search for the best suitable <italic>Echinococcus</italic> species to be used as a template. In this regard, we found that <italic>E. multilocularis</italic> was the most suitable species because of the integrity of the assembly and the higher identity with the genome of <italic>E. oligarthrus</italic> (<xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Table S2</bold>
</xref>). The final set of <italic>E. oligarthrus</italic> genes comprised 8753 genes coding for proteins (4494 genes with coverage &gt; 50% and 4259 genes with coverage &lt; 50%) (<xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Table S3</bold>
</xref>). The whole set of genes represented the 85.0% of the total genes expected to be found in species of the genus <italic>Echinococcus</italic>. The predicted proteins were also screened against the conserved core of genes that are present in all the cestodes species according to <xref ref-type="bibr" rid="B32">Maldonado et al. (2017)</xref> using BLAST. In this regard, 4872 out of 5203 genes were found for <italic>E. oligarthrus</italic> using e values of &lt;1e&#x2212;12, which comprised the 93.6% of the total conserved core of genes in cestodes, indicating that the genome contains useful molecular data (<xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Table S4</bold>
</xref>).</p>
<table-wrap id="T1" position="float">
<label>Table 1</label>
<caption>
<p>Genome-wide statistics for the <italic>E. oligarthrus</italic> assembly and gene findings.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top">Genome statistics</th>
<th valign="top">Scaffolds</th>
<th valign="top">Contigs</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top">Size of genome (Mb) (*)</td>
<td valign="top">86.2</td>
<td valign="top">105.9</td>
</tr>
<tr>
<td valign="top">GC content (%)</td>
<td valign="top">41</td>
<td valign="top">41</td>
</tr>
<tr>
<td valign="top">Number of sequences</td>
<td valign="top">3764</td>
<td valign="top">74,513</td>
</tr>
<tr>
<td valign="top">N50 (Mb)</td>
<td valign="top">10.2</td>
<td valign="top">12.83 kb</td>
</tr>
<tr>
<td valign="top">Gaps (Ns/100 kb)</td>
<td valign="top">1366</td>
<td valign="top">0</td>
</tr>
<tr>
<td valign="top">Largest contig (Mb)</td>
<td valign="top">16.0</td>
<td valign="top">63.7 kb</td>
</tr>
<tr>
<td valign="top">Deep coverage</td>
<td valign="top">20&#xd7;</td>
<td valign="top">20&#xd7;</td>
</tr>
<tr>
<td valign="top">Number of predicted genes</td>
<td valign="top">8753</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">Gene density per Mb </td>
<td valign="top">101.5</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">Length of proteome (amino acids)</td>
<td valign="top">2,009,586</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">Maximum protein length (amino acids)</td>
<td valign="top">2254</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">Average protein length (amino acids)</td>
<td valign="top">229</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">Average exon length (bp)</td>
<td valign="top">203</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">Median exon length (bp)</td>
<td valign="top">158</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">Average exons per transcript</td>
<td valign="top">3</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">Median exons per transcript</td>
<td valign="top">2</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">Total length of contained introns (kb)</td>
<td valign="top">8622</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">Average intron length (bp)</td>
<td valign="top">709</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">Median intron length (bp)</td>
<td valign="top">262</td>
<td valign="top"/>
</tr>
<tr>
<td valign="top">BUSCO</td>
<td valign="top">235/303</td>
<td valign="top">162/303</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>(*) Redundant sequences (19.7 Mb) were removed from the contigs to obtain the final scaffolds assembly.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s4_2">
<title>
<italic>E. oligarthrus</italic> Genes</title>
<p>GO terms were assigned to the 40% of <italic>E. oligarthrus</italic> proteins. In relation to the Molecular Function GO terms frequency, the two main categories found were &#x201c;binding (GO:0005488)&#x201d; and &#x201c;catalytic activity (GO:0003824)&#x201d;, which is in accordance with the GO terms frequency observed in other cestode genomes (<xref ref-type="bibr" rid="B19">Hahn et al., 2014</xref>) and <italic>Echinococcus</italic> species (<xref ref-type="bibr" rid="B32">Maldonado et al., 2017</xref>). For the Biological Process GO term, the highest frequencies observed were the categories &#x201c;cellular process (GO:0009987)&#x201d; and &#x201c;metabolic process (GO:0008152)&#x201d;, also in accordance with the GO terms frequency observed in the related organism (<xref ref-type="supplementary-material" rid="SM4">
<bold>Supplementary Figure 3</bold>
</xref>).</p>
<p>From the total gene-set, we searched for genes that have already been described as having a role in host&#x2013;parasite interactions in <italic>Echinococcus</italic> (<xref ref-type="bibr" rid="B7">Brehm and Koziol, 2017</xref>). In this regard, we found 56 genes involved in host&#x2013;parasite interactions whose coverages and identities to <italic>E. multilocularis</italic> orthologous genes were &gt;50% (<xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Table S5</bold>
</xref>). Particularly, two genes encoding for the epidermal growth factor (EGF) tyrosine kinase receptors (Eoli_000075800 and Eoli_000617300) with a high percentage of identity (59.8% and 52.6%) and coverage (65.3% and 51.4%) were identified. Also, we found a gene encoding for fibroblast growth factor (FGF) receptor tyrosine kinase (Eoli_000833200) whose percentage of identity and coverage values were 76% and 100%, respectively. Regarding nuclear receptor hormones, we found nine orthologous genes with high identity and coverage (83.8% and 51.8% on average, respectively) including the cestode-specific nuclear hormone receptor Eoli_000937000. Non-kinase receptors were also found, comprising four genes, including Frizzled G protein-coupled receptors (GPCR) (Eoli_000682100) with high identity and coverage (88.6% and 87.6%, respectively). Finally, we identified the amino acid transporters (DAACS family), lipid binding proteins (FABPs), and antigens (AgB and Eg95), specific and conserved proteins in cestodes/<italic>Echinococcus</italic>.</p>
</sec>
<sec id="s4_3">
<title>Comparative Whole Genome Based on SNPs Analysis</title>
<p>The nucleotide variation in the genomes of different <italic>Echinococcus</italic> species was assessed through variant calling analysis. Genetic variants were identified among the genomes of <italic>E. oligarthrus</italic>, <italic>E. multilocularis</italic>, <italic>E. granulosus</italic> G1, and <italic>E. canadensis</italic> G7. Here, we focused on the study of single-nucleotide polymorphisms (SNPs). In order to perform this analysis, NGS raw reads were mapped against a reference genome composed of chromosomes and contigs. For all the analyses, the reads were first mapped against their own reference genomes and then against the genome of the corresponding analyzed species. Homozygous and heterozygous variant sites were identified and were marked in both the reference and the alternative allele (see Materials and Methods for more details). First, we evaluated the intraspecific variation of the <italic>Echinococcus</italic> genomes. As described in <xref ref-type="bibr" rid="B32">Maldonado et al. (2017)</xref>. <italic>E. granulosus</italic> G1 genome exhibited the highest number of intraspecific variant sites (74,796 SNPs, 0.65 SNPs/kb), followed by <italic>E. oligarthrus</italic> (23,301 SNP, 0.23 SNP/kb), <italic>E. canadensis</italic> G7 (10,791 SNPs, 0.095 SNPs/kb), and <italic>E. multilocularis</italic> (1,287 SNPs, 0.011 SNPs/kb). With regard to the genetic diversity observed among the <italic>Echinococcus</italic> species, we observed that the SNP distribution was similar to the distribution described in our previous research (<xref ref-type="bibr" rid="B32">Maldonado et al., 2017</xref>). <italic>E. canadensis</italic> G7 and <italic>E. granulosus</italic> G1 showed a higher number of SNPs (842,322, Ts/Tv = 2.97) between each other than between <italic>E. canadensis</italic> G7 and <italic>E. multilocularis</italic> (314,176, Ts/Tv = 2.97), and in comparison to <italic>E. granulosus</italic> G1 and <italic>E. multilocularis</italic> (272,138, Ts/Tv = 2.94). Furthermore, the pair <italic>E. canadensis</italic> G7 and <italic>E. granulosus</italic> G1 also showed almost 10 orders of magnitude more SNPs than between <italic>E. oligarthrus</italic> and <italic>E. multilocularis</italic> (38,911, Ts/Tv = 3.25). The number of SNPs between <italic>E. oligarthrus</italic> and <italic>E. granulosus</italic> (126,472 Ts/Tv = 2.99) was similar to the number of SNPs between <italic>E. oligarthrus</italic> and <italic>E. canadensis</italic> G7 (171,135, Ts/Tv = 3.00). We also identified homozygous and heterozygous variant sites for the four <italic>Echinococcus</italic> species, for both the reference and the alternative allele in each case. The number of homozygous SNPs was 313,992 for <italic>E. canadensis</italic> G7 and <italic>E. multilocularis</italic>, 266,180 for <italic>E. granulosus</italic> G1 and <italic>E. multilocularis</italic>, 38,762 for <italic>E. oligarthrus</italic> and <italic>E. multilocularis</italic>, 830,768 for <italic>E. canadensis</italic> G7 and <italic>E. granulosus</italic> G1, 125,147 for <italic>E. oligarthrus</italic> and <italic>E. granulosus</italic> G1, and 170,049 for <italic>E. oligarthrus</italic> and <italic>E. canadensis</italic> G7 (<xref ref-type="table" rid="T2">
<bold>Table 2</bold>
</xref>). Moreover, the SNP density was assessed in terms of the number of SNPs per 1-Mb length of the <italic>Echinococcus</italic> chromosomes. For all the <italic>Echinococcus</italic> genomes, the highest SNP density (SNPs/Mb of the chromosome) was found for chromosome 1. The chromosomes 2, 3, and 4 showed a slightly lower SNP density than chromosome 1 and chromosomes 5, 6, 7, 8, and 9 showed the lowest SNP density (<xref ref-type="fig" rid="f1">
<bold>Figure 1</bold>
</xref>). However, the intraspecific SNP density distribution for <italic>E. granulosus</italic> G1 was higher in chromosomes 1, 5, and 9. We also evaluated the SNP distribution of <italic>E. oligarthrus</italic> in the coding and non-coding genomic regions (intergenic, exons, and introns) of <italic>E. multilocularis</italic>. In this regard, the 38,911 SNPs distributed with a higher rate in exons and introns rather than in the intergenic regions. However, those distributed in the coding regions exhibited a higher rate of synonymous changes (67.3%) rather than missense changes (32.6%) (<xref ref-type="supplementary-material" rid="SM5">
<bold>Supplementary Figure 4</bold>
</xref>).</p>
<table-wrap id="T2" position="float">
<label>Table 2</label>
<caption>
<p>Number and type of SNPs among <italic>Echinococcus</italic> species.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="top">Species</th>
<th valign="top">Sample</th>
<th valign="top">Alt : Het</th>
<th valign="top">Alt : Hom</th>
<th valign="top">Total number of SNPs</th>
<th valign="top">Ts/Tv</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top">
<italic>E. multilocularis</italic>
</td>
<td valign="top">JAVA05</td>
<td valign="top">1152</td>
<td valign="top">135</td>
<td valign="top">1287</td>
<td valign="top">1.98</td>
</tr>
<tr>
<td valign="top">
<italic>E. canadensis</italic> G7</td>
<td valign="top">PH14</td>
<td valign="top">9937</td>
<td valign="top">854</td>
<td valign="top">10,791</td>
<td valign="top">3.25</td>
</tr>
<tr>
<td valign="top">
<italic>E. granulosus</italic> G1</td>
<td valign="top">GH09/3</td>
<td valign="top">69,672</td>
<td valign="top">5124</td>
<td valign="top">74,796</td>
<td valign="top">2.32</td>
</tr>
<tr>
<td valign="top">
<italic>E. oligarthrus</italic>
</td>
<td valign="top">ADA3-ADA5</td>
<td valign="top">23,074</td>
<td valign="top">227</td>
<td valign="top">23,301</td>
<td valign="top">2.97</td>
</tr>
<tr>
<td valign="top">
<italic>E. canadensis</italic> G7 vs <italic>E. multilocularis</italic>
<sup>a</sup>
</td>
<td valign="top">PH14&#x2013;JAVA05</td>
<td valign="top">168</td>
<td valign="top">313,992</td>
<td valign="top">314,176</td>
<td valign="top">2.97</td>
</tr>
<tr>
<td valign="top">
<italic>E. granulosus</italic> G1 vs <italic>E. multilocularis</italic>
<sup>b</sup>
</td>
<td valign="top">GH09/3&#x2013;JAVA05</td>
<td valign="top">5940</td>
<td valign="top">266,180</td>
<td valign="top">272,138</td>
<td valign="top">2.94</td>
</tr>
<tr>
<td valign="top">
<italic>E. oligarthrus</italic> vs <italic>E. multilocularis</italic>
<sup>c</sup>
</td>
<td valign="top">ADA&#x2013;JAVA05</td>
<td valign="top">147</td>
<td valign="top">38,762</td>
<td valign="top">38,911</td>
<td valign="top">3.25</td>
</tr>
<tr>
<td valign="top">
<italic>E. canadensis</italic> vs <italic>E. granulosus </italic>G1<sup>d</sup>
</td>
<td valign="top">PH14&#x2013;H95/5</td>
<td valign="top">1359</td>
<td valign="top">830,768</td>
<td valign="top">842,322</td>
<td valign="top">2.97</td>
</tr>
<tr>
<td valign="top">
<italic>E. oligarthrus</italic> vs <italic>E. granulosus</italic> G1<sup>e</sup>
</td>
<td valign="top">ADA&#x2013;H95/5</td>
<td valign="top">667</td>
<td valign="top">125,147</td>
<td valign="top">126,472</td>
<td valign="top">2.99</td>
</tr>
<tr>
<td valign="top">
<italic>E. oligarthrus</italic> vs <italic>E. canadensis</italic> G7<sup>f</sup>
</td>
<td valign="top">ADA3&#x2013;ADA5&#x2013;PH14</td>
<td valign="top">1044</td>
<td valign="top">170,049</td>
<td valign="top">171,135</td>
<td valign="top">3.00</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>
<sup>a</sup>Ref/Alt : Hom/Hom: 6; Ref/Alt : Hom/Het:0; Ref/Alt : Het/Het:1; Ref/Alt : Het/Hom:9. <sup>b</sup>Ref/Alt : Hom/Hom: 7; Ref/Alt : Hom/Het:0; Ref/Alt : Het/Het:0; Ref/Alt : Het/Hom:11. <sup>c</sup>Ref/Alt: Hom/Hom: 0; Ref/Alt : Hom/Het:0; Ref/Alt : Het/Het:0; Ref/Alt : Het/Hom:2. <sup>d</sup>Ref/Alt : Hom/Hom: 944; Ref/Alt : Hom/Het:0; Ref/Alt : Het/Het:13; Ref/Alt : Het/Hom:9238. <sup>e</sup>Ref/Alt : Hom/Hom: 55; Ref/Alt : Hom/Het:0; Ref/Alt : Het/Het:0; Ref/Alt : Het/Hom:603. <sup>f</sup>Ref/Alt : Hom/Hom: 7; Ref/Alt : Hom/Het:0; Ref/Alt : Het/Het:7; Ref/Alt : Het/Hom:35.</p>
</table-wrap-foot>
</table-wrap>
<fig id="f1" position="float">
<label>Figure 1</label>
<caption>
<p>Single-nucleotide polymorphisms in <italic>Echinococcus</italic> chromosomes. Density of single-nucleotide polymorphisms (SNPs/Mb) by chromosome 1&#x2013;9 among <italic>E. oligarthrus</italic>, <italic>E. multilocularis</italic>, <italic>E. canadensis</italic> G7, and <italic>E. granulosus</italic> G1 species.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fgene-10-00708-g001.tif"/>
</fig>
</sec>
<sec id="s4_4">
<title>
<italic>Echinococcus</italic> Phylogeny Reconstruction by Whole-Genome SNP Analysis</title>
<p>In order to evaluate the contribution of the SNPs to the genetic diversity among the different <italic>Echinococcus</italic> species, we performed phylogenetic analyses by implementing three different approaches: the first used the whole genome variant sites for the four <italic>Echinococcus</italic> species (only the SNPs), which consisted of analyzing 244,246 sites in each genome arising a total of 2.08 Mb; the second approach used genome regions for the four <italic>Echinococcus</italic> species whose depth coverage was &gt;20&#xd7; and that strictly contained variant sites supported by more than 20 reads, which involved the analysis of 40,179,279 sites in each genome arising in a total of &#x223c;162 Mb. For the third approach, we used only coding regions for the four <italic>Echinococcus</italic> species whose depth coverage was &gt;20&#xd7; and that strictly contained variant sites supported by more than 20 reads. This involved the analysis of 42,200 sites in each genome arising in a total of 168.8 kb. In all the cases, only the homozygous SNPs were used to perform phylogenetic analyses. The sites and sequences retained here were concatenated and the resulting alignment was used to create the phylogenetic trees by implementing the maximum likelihood and Bayesian methods (see Materials and methods for more details).</p>
<p>The construction of the phylogenetic tree implementing the Bayesian method and using whole-genome SNPs showed a topology that demonstrated a higher genetic distance between <italic>E. canadensis</italic> G7 and <italic>E. granulosus</italic> G1 in comparison with the common node from which <italic>E. multilocularis</italic> and <italic>E. oligarthrus</italic> diverge equidistantly at a very short distance (<xref ref-type="fig" rid="f2">
<bold>Figure 2A</bold>
</xref>). <italic>E. multilocularis</italic> and <italic>E. oligarthrus</italic> exhibited a low genetic diversity between each other. PartitionFinder 2 (<xref ref-type="bibr" rid="B27">Lanfear et al., 2016</xref>) was used to select the best-fit partitioning schemes and models of evolution for the phylogenetic analysis. Transversional substitution model (TVMef) was the best-fitted model. Phylogenetic analysis using TVMef was consistent with the previous result (<xref ref-type="fig" rid="f2">
<bold>Figure 2B</bold>
</xref>). The topologies of the trees were also consistent with the topology observed using the Maximum Likelihood method (<xref ref-type="supplementary-material" rid="SM6">
<bold>Supplementary Figure 5A</bold>
</xref>). Phylogenetic analysis using genomic regions with depth coverage &gt;20&#xd7; sites and coding regions with depth coverage &gt;20&#xd7; also showed the same topology, except for the branches that exhibited different lengths (<xref ref-type="supplementary-material" rid="SM6">
<bold>Supplementary Figures 5B, C</bold>
</xref>). The total length of the coding sequences used here was &#x223c;42.2 kb and the genes sampled and used for this purpose are listed in <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Table S6</bold>
</xref>.</p>
<fig id="f2" position="float">
<label>Figure 2</label>
<caption>
<p>
<italic>Echinococcus</italic> phylogenetic tree based on genome-wide nuclear single nucleotide polymorphisms. Phylogenetic tree was constructed using BEAST software using <bold>(A)</bold> GTR + C + I model from four full genome SNPs sequences. <bold>(B)</bold> Transversional substitution model (TVMef) from four full genome SNPs sequences.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fgene-10-00708-g002.tif"/>
</fig>
<p>In addition, we registered how many polymorphic loci are shared among the <italic>Echinococcus</italic> species containing the same polymorphism. In previous research, we determined that the number of shared loci was higher when we used <italic>E. multilocularis</italic> as the reference genome, demonstrating to be the basal species and discarding <italic>E.granulosus</italic> G1 and <italic>E. canadensis</italic> G7 as possible candidates (<xref ref-type="bibr" rid="B32">Maldonado et al., 2017</xref>). In order to evaluate whether <italic>E. oligarthrus</italic> occupies a basal position in the genus, we incorporated <italic>E. oligarthrus</italic> and analyzed the number of shared loci using both <italic>E. multilocularis</italic> and <italic>E. oligarthrus</italic> as reference genomes and compared the results to each other. Due to the different genome quality assemblies and coverage, we normalized the number of SNPs in shared loci using the effective length of the sampled regions under the assumption that all the genome regions are equally subjected to mutation. In this regard, we found that 12,282 loci shared the same nucleotide change among <italic>E. canadensis</italic> G7, <italic>E. granulosus</italic> G1, and <italic>E. oligarthrus</italic> with respect to <italic>E. multilocularis</italic> (<italic>E. multilocularis</italic> used as reference). Moreover, the number of shared loci between <italic>E. oligarthrus</italic> and <italic>E. granulosus</italic> G1, between <italic>E. oligarthrus</italic> and <italic>E. canadensis</italic> G7, and between <italic>E. canadensis</italic> G7 and <italic>E. granulosus</italic> G1 was 4115, 8919, and 92,326, respectively. Most of the loci were unique for <italic>E. granulosus</italic> G1 (226,350) and <italic>E. canadensis</italic> G7 (335,728) rather than for <italic>E. oligarthrus</italic> (40,386). On the other hand, the number of shared loci containing the same nucleotide change among <italic>E. canadensis</italic> G7, <italic>E. granulosus</italic> G1, and <italic>E. multilocularis</italic> was 12,277 with respect to <italic>E. oligarthrus </italic>(<italic>E. oligarthrus</italic> used as reference). The number of shared loci between <italic>E. multilocularis</italic> and <italic>E. granulosus</italic> G1, between <italic>E. multilocularis</italic> and <italic>E. canadensis</italic> G7, and between <italic>E. canadensis</italic> G7 and <italic>E. granulosus</italic> G1 was 8675, 8875, and 54,792 respectively. Furthermore, and similar to the above results, the highest numbers of unique loci were for <italic>E. granulosus</italic> G1 (133,417) and <italic>E. canadensis</italic> G7 (242,815), rather than for <italic>E. multilocularis</italic> (60,455) (<xref ref-type="fig" rid="f3">
<bold>Figure 3</bold>
</xref>).</p>
<fig id="f3" position="float">
<label>Figure 3</label>
<caption>
<p>Venn diagram of shared loci between the different <italic>Echinococcus</italic> species using <italic>E. multilocularis</italic>
<bold>(A)</bold> and <italic>E. oligarthrus</italic>
<bold>(B)</bold> as reference. The numbers under each species name indicate the number of SNPs in the species against the corresponding reference genome. The numbers in the overlap region indicate the number of SNPs with the same polymorphism at the same locus between the species analyzed.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fgene-10-00708-g003.tif"/>
</fig>
<p>In order to gain accuracy and supporting evidence for our previous results, we reconstructed the <italic>Echinococcus</italic> phylogeny with nuclear molecular markers previously described (<xref ref-type="bibr" rid="B23">Kinkar et al., 2017</xref>). Nuclear molecular marker sequences of all the available <italic>Echinococcus</italic> species were concatenated and then were aligned and analyzed using both the Bayesian and maximum likelihood methods. This analysis involved the study of 1552 nucleotides that allowed to root the trees and reinforced the basal position of <italic>E. oligarthrus</italic> (<xref ref-type="supplementary-material" rid="SM6">
<bold>Supplementary Figure 5D</bold>
</xref>). The phylogenetic topology obtained was consistent with previous phylogenetic analyses of (<xref ref-type="bibr" rid="B38">Nakao et al., 2013b</xref>) that have placed <italic>E. oligarthrus</italic> (and <italic>E. vogeli</italic>) in a basal position for the genus <italic>Echinococcus</italic>. Similar phylogenetic topologies were obtained with the two methods employed here (<xref ref-type="supplementary-material" rid="SM6">
<bold>Supplementary Figure 5E</bold>
</xref>).</p>
</sec>
</sec>
<sec id="s5" sec-type="discussion">
<title>Discussion</title>
<p>In this work, we sequenced the first sylvatic species of <italic>Echinococcus</italic>, <italic>E. oligarthrus</italic>, isolated from its natural host and performed comparative genomics between both domestic and sylvatic species. One of the limitations of obtaining complete genomes of wildlife parasites from natural infections resides on the difficulty of obtaining DNA samples free of the host material. In particular, for parasites whose development occurs in intimate contact with the host tissue, such is the case of <italic>E. oligarthrus</italic>. Here, we performed an extensive effort to identify the target sequences of the parasite and assemble the genome. Several steps were applied before the final assembly was obtained, and the final assembly was compared with the <italic>Echinococcus</italic> genomes obtained previously by us (<xref ref-type="bibr" rid="B32">Maldonado et al., 2017</xref>). Even though the quality of <italic>E. oligarthrus</italic> genome assembly was lower than other <italic>Echinococcus</italic> genomes, it was high enough to locate the genes and perform comparative genomic analysis.</p>
<p>Hereby, we used these data to unravel the phylogeny of this genus. In previous studies, we described the genetic variation among three <italic>Echinococcus</italic> species (<italic>E. canadensis</italic> G7, <italic>E. granulosus</italic> G1, and <italic>E. multiloccularis</italic>) and assessed the distribution of SNPs in the whole genome as well as the effect and the type of SNPs in the coding regions. In this work, we added a new genome to the analysis of genetic variants and studied the SNP distribution in each one of their chromosomes. Regarding the genetic diversity among the <italic>Echinococcus</italic> species, we found that <italic>E. canadensis</italic> G7 and <italic>E. granulosus</italic> G1 contained almost 10 orders of magnitude more SNPs than between <italic>E. oligarthrus</italic> and <italic>E. multilocularis</italic>. The SNP distribution observed is similar to the distribution described in our previous research (<xref ref-type="bibr" rid="B32">Maldonado et al., 2017</xref>) where the genetic diversity within the <italic>E. granulosus sensu lato</italic> species was high. On the other hand, the genetic diversity between the sylvatic species <italic>E. oligarthrus</italic> and <italic>E. multilocularis</italic> is remarkably lower. Furthermore, the genetic variability showed to be unequal for different chromosomes. This fact was revealed by a higher SNP density in larger chromosomes than in the smallest ones. In previous studies, we have also reported a higher gene density in larger chromosomes (<xref ref-type="bibr" rid="B33">Maldonado et al., 2018</xref>). However, since most of the SNPs produce synonymous nucleotide changes, the amino acid sequences derived from these genes, even those located in the larger chromosomes, are not altered by the presence of such changes and presumably neither their function.</p>
<p>For several years, the mitochondrial sequences were employed to analyze the <italic>Echinococcus</italic> phylogeny (for a review, see <xref ref-type="bibr" rid="B36">Nakao et al., 2013a</xref>). However, the construction of phylogenetic trees based only on mitochondrial DNA data may be biased because it is maternally inherited and, therefore, under particular evolutionary forces that may not represent the evolutionary history for each species (<xref ref-type="bibr" rid="B30">Lymbery, 2017</xref>). Indeed, it has been suggested that nuclear sequences should be used when evaluating the phylogenetic positions of new <italic>Echinococcus</italic> isolates (<xref ref-type="bibr" rid="B47">Saarma et al., 2009</xref>). Here, we performed maximum likelihood and Bayesian phylogenetic analyses using nuclear DNA sequences and compared the results implementing different models including the best-fitted evolutionary model predicted by PartitionFinder 2 (<xref ref-type="bibr" rid="B27">Lanfear et al., 2016</xref>). For this purpose, we used all the shared loci within SNPs that were identified among the <italic>Echinococcus</italic> genomes in both whole genome and coding regions and under the strict criteria of having more than 20&#xd7; depth coverage in all the species. Hereby, and adding previously described nuclear molecular markers, which provides high accuracy to our results, the tree topology retrieved as the most frequent reconstruction placed <italic>E. oligarthrus</italic> in a basal position. Based on these analyses, we conclude that <italic>E. oligarthrus</italic> may be one of the basal species of the genus <italic>Echinococcus</italic>, together with <italic>E. multilocularis</italic>. These findings also agree with our previous studies (<xref ref-type="bibr" rid="B32">Maldonado et al., 2017</xref>) where we proposed a basal sylvatic species that could have accumulated mutations over time until a speciation phenomenon could have given rise to <italic>E. granulosus</italic> G1 and <italic>E. canadensis</italic> G7, which afterwards would have diverged, independently increasing the genetic diversity. The fact that <italic>E. canadensis</italic> G7 and <italic>E. granulosus</italic> G1 share more homozygous polymorphic loci with the same variant supports the hypothesis of a basal sylvatic species. However, since the number of homozygous polymorphic loci with the same variant shared among three of the species is almost equal (12,282 for <italic>E. multilocularis</italic> and 12,277 for <italic>E. oligarthrus</italic> as reference genomes), this result does not allow one to resolve whether <italic>E. multilocularis</italic>, <italic>E. oligarthrus</italic>, or other unknown ancestral related species is the ancestral species from which modern <italic>Echinococcus</italic> genomes could have arisen, and thus remains unclear. This hypothesis could be further probed with the complete genome analyses of more <italic>Echinococcus</italic> species, which would be really useful to describe the complete evolutionary history of these parasites. One of the most interesting implications of the nuclear phylogeny based on SNP analysis found in this work is the position of the <italic>Echinococcus</italic> sylvatic species as basal to the genus <italic>Echinococcus</italic>; in addition, it also demonstrates relevant genetic similarities between <italic>E. multilocuaris</italic> and <italic>E. oligarthrus</italic>. This evolutionary framework may enable data-driven investigation of morphological features and developmental evolutionary studies that would provide relevant information about the neotropical echinococcosis. The generation of genome datasets from additional cestode species would further improve these findings. Phylogenetic studies have allowed the development of hypotheses about the evolutionary history of several taxonomic groups from other perspectives. Such is the case of parasitic organisms and the phylogeography of their hosts that helps to interpret parasite evolution in relation to the migratory patterns of their hosts and vice versa. In the neotropical region, several felids serve as definitive hosts for <italic>E. oligarthrus</italic>. Recently, we determined for the first time the presence of <italic>E. oligarthrus</italic> in ocelot (<italic>Leopardus pardalis</italic>) and the puma (<italic>Puma concolor</italic>) in the north of Argentina using nuclear and mitochondrial molecular markers (<xref ref-type="bibr" rid="B1">Arrabal et al., 2017</xref>). Indeed, all the cases reported so far have come from South America (<xref ref-type="bibr" rid="B10">D&#x2019;Alessandro and Rausch, 2008</xref>). In terms of phylogeography, the most suitable explanation is that carnivores originate from immigrants from North America and the ancestral species of <italic>Echinococcus</italic> migrated to South America together with their felid hosts. Early studies suggested that the differentiation of the species of <italic>Leopardus</italic> was likely facilitated by the formation of the Panamanian land bridge. The current hypothesis about felid evolution based on molecular phylogenetic studies suggests that the endemic neotropical felids (genus <italic>Leopardus</italic>) have diverged from other main felid lineages and that before the emergence of the Panamanian isthmus they could have migrated to South America. (<xref ref-type="bibr" rid="B21">Johnson et al., 2006</xref>). The finding of an archaic lineage of <italic>Trichinella</italic> in South America also supports the hypothesis of this early carnivore expansion (<xref ref-type="bibr" rid="B40">Pozio et al., 2009</xref>). Although originating in North America, the Puma currently has an extensive geographic range in South America and could explain the presence of <italic>E. oligarthrus</italic> in several felid species from the neotropical region. Regarding intermediate hosts, the rodents of the <italic>Hystricomorpha</italic> suborder, natural hosts of <italic>E. oligarthrus</italic>, are known to be the dominant small terrestrial herbivores in South America by the Miocene (<xref ref-type="bibr" rid="B14">Eisenberg et al., 1989</xref>). Hence, both the neotropical <italic>Echinococcus</italic> species and their respective hosts seem to have an ancient origin (<xref ref-type="bibr" rid="B38">Nakao et al., 2013b</xref>). The need for more genomes and analyses of host&#x2013;parasite interactions are evident in order to further understand the co-evolution between this parasite and its felid host and the lifestyles of <italic>Echinococcus</italic> species.</p>
<p>Before sequencing the <italic>E. oligarthrus</italic> genome, there was a paucity of related molecular data of this organism. Indeed, only 47 nucleotide sequences have been reported so far, representing only nine genes. Most of them were obtained with the sole purpose of being used as molecular markers (e.g., 18s rRNA nuclear gene or cox1 mitochondrial gene). The scarce sequences information about this parasite was significantly improved through our effort to sequence, assemble, and annotate the genome of <italic>E. oligarthrus</italic>. Despite the fragmented nature of the assembly, we have thoroughly analyzed the gene content in comparisons with other members of the genus. Most of the core genes are identifiable in the genome even though many genes are present as fragments within the assembly. Although many genes presented coverage &lt; 50%, the GO terms distribution found for <italic>E. oligarthrus</italic> was according to that observed in other <italic>Echinococcus</italic> species. Even more, we found that many genes are typically conserved in all the cestodes species. Indeed, these genes composed the core of cestodes genes according to <xref ref-type="bibr" rid="B32">Maldonado et al. (2017)</xref>, which means that these genes can be found in all the cestodes species whose genomes have been sequenced so far. Therefore, both the genome and the gene annotations of <italic>E. oligarthrus</italic> are suitable to be used in several bioinformatic and comparative analysis as well as to guide hummed and molecular assays. Several works have reported the molecular and cellular mechanisms implicated in the larvae development of <italic>E. multilocularis</italic> and <italic>E. granulosus sensu stricto</italic> (<xref ref-type="bibr" rid="B7">Brehm and Koziol, 2017</xref>) but not much is known about the neotropical <italic>Echinococcus</italic> species that have a particular larvae morphology (<xref ref-type="bibr" rid="B10">D&#x2019;Alessandro and Rausch, 2008</xref>). By means of this new genome, we expanded the repertoire of the available genes and reached the 85% of the genes expected to be present in <italic>Echinococcus</italic>, providing for the first time a large set of proteins of a neotropical <italic>Echinococcus</italic> species that can be further studied. Until now, only two families of coding genes implicated in host&#x2013;parasite interactions had been sequenced from <italic>E. oligarthrus</italic>, the antigen Eg95 and the partial sequence of the antigen B (<xref ref-type="bibr" rid="B18">Haag et al., 2006</xref>, <xref ref-type="bibr" rid="B17">Haag et al., 2009</xref>). In this regard, we searched for proteins already known to be involved in host&#x2013;parasite interactions, including genes that are responsible for larval development and signaling pathways in <italic>Echinococcus</italic>. Genes such as Wnt or TNF&#x3b1; receptor and putative regulatory genomic sequences of thousands of important genes were described here. By means of this work, the study of the genes and genomic regulatory regions of the neotropical species <italic>E. oligarthrus</italic> are now reachable by the scientific community. The data obtained here will allow the design of data-driven experiments of gene expression that will provide clues about the particular behavior of the parasite into the mammalian hosts and its differences between sylvatic and domestic species.</p>
</sec>
<sec id="s6">
<title>Data Availability</title>
<p>The assembled sequences of the E. oligarthrus genome were deposited in ENA (BioProject PRJEB31222, <uri xlink:href="http://www.ebi.ac.uk/ena/data/view/PRJEB31222">http://www.ebi.ac.uk/ena/data/view/PRJEB31222</uri>). The sequences and the annotation data can also be downloaded from our FlatDB project web page (<uri xlink:href="http://www.bmhid.org/flatdb/">http://www.bmhid.org/flatdb/</uri>). All the data generated and analyzed in this study are included in this published article and within the <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Material</bold>
</xref>.</p>
</sec>
<sec id="s7">
<title>Author Contributions</title>
<p>LM performed the bioinformatics analysis and wrote the manuscript. JA collected the parasite material and performed the genetic and morphological analysis. LM and LK designed the study. GO constructed the libraries and performed the sequencing. LM, LK, MR, and GO wrote and revised the manuscript. All authors read and approved the manuscript.</p>
</sec>
<sec id="s8" sec-type="funding-information">
<title>Funding</title>
<p>This study was supported by the MinCyT CAPES BR/RED 1413 (L.K.), Secretaria de Pol&#xed;ticas Universitarias-Ministerio de Educaci&#xf3;n, Cultura, Ciencia y Tecnolog&#xed;a, Argentina (CAPG-BA 070/13) (L.K. and G.O.) and Sistema Nacional de Computaci&#xf3;n de Alto Desempe&#xf1;o (SNCAD-MiNCyT) (L.K.).</p>
</sec>
<sec id="s9">
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<title>Acknowledgments</title>
<p>The authors thank Gabriel Lichtenstein for server administration and maintenance.</p>
</ack>
<sec sec-type="supplementary-material" id="s10">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2019.00708/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2019.00708/full#supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="Table_1.xlsx" id="SM1" mimetype="application/xlsx"/>
<supplementary-material xlink:href="DataSheet_1.pdf" id="SM2" mimetype="application/pdf"/>
<supplementary-material xlink:href="DataSheet_2.pdf" id="SM3" mimetype="application/pdf"/>
<supplementary-material xlink:href="DataSheet_3.pdf" id="SM4" mimetype="application/pdf"/>
<supplementary-material xlink:href="DataSheet_4.pdf" id="SM5" mimetype="application/pdf"/>
<supplementary-material xlink:href="DataSheet_5.pdf" id="SM6" mimetype="application/pdf"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Arrabal</surname> <given-names>J. P.</given-names>
</name>
<name>
<surname>Avila</surname> <given-names>H. G.</given-names>
</name>
<name>
<surname>Rivero</surname> <given-names>M. R.</given-names>
</name>
<name>
<surname>Camicia</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Salas</surname> <given-names>M. M.</given-names>
</name>
<name>
<surname>Costa</surname> <given-names>S. A.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>
<italic>Echinococcus oligarthrus</italic> in the subtropical region of Argentina: first integration of morphological and molecular analyses determines two distinct populations</article-title>. <source>Vet. Parasitol.</source> <volume>240</volume>, <fpage>60</fpage>&#x2013;<lpage>67</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.vetpar.2017.03.019</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ashburner</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Ball</surname> <given-names>C. A.</given-names>
</name>
<name>
<surname>Blake</surname> <given-names>J. A.</given-names>
</name>
<name>
<surname>Botstein</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Butler</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Cherry</surname> <given-names>J. M.</given-names>
</name>
<etal/>
</person-group>. (<year>2000</year>). <article-title>Gene Ontology: tool for the unification of biology</article-title>. <source>Nat. Genet.</source> <volume>25</volume>, <fpage>25</fpage>&#x2013;<lpage>29</lpage>. doi: <pub-id pub-id-type="doi">10.1038/75556</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bankevich</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Nurk</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Antipov</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Gurevich</surname> <given-names>A. A.</given-names>
</name>
<name>
<surname>Dvorkin</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Kulikov</surname> <given-names>A. S.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing</article-title>. <source>J. Comput. Biol.</source> <volume>19</volume>, <fpage>455</fpage>&#x2013;<lpage>477</lpage>. doi: <pub-id pub-id-type="doi">10.1089/&#xad;cmb.2012.0021</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barnett</surname> <given-names>D. W.</given-names>
</name>
<name>
<surname>Garrison</surname> <given-names>E. K.</given-names>
</name>
<name>
<surname>Quinlan</surname> <given-names>A. R.</given-names>
</name>
<name>
<surname>Str&#xf6;mberg</surname> <given-names>M. P.</given-names>
</name>
<name>
<surname>Marth</surname> <given-names>G. T.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>BamTools: a C++ API and toolkit for analyzing and managing BAM files</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>1691</fpage>&#x2013;<lpage>1692</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btr174</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Birney</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Clamp</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Durbin</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>GeneWise and genomewise</article-title>. <source>Genome Res.</source> <volume>14</volume>, <fpage>988</fpage>. doi: <pub-id pub-id-type="doi">10.1101/gr.1865504</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bolger</surname> <given-names>A. M.</given-names>
</name>
<name>
<surname>Lohse</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Usadel</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Trimmomatic: a flexible trimmer for Illumina sequence data</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>2114</fpage>&#x2013;<lpage>2120</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btu170</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brehm</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Koziol</surname> <given-names>U.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>
<italic>Echinococcus</italic>&#x2013;host interactions at cellular and molecular levels</article-title>. <source>Adv. Parasitol.</source> <volume>95</volume>, <fpage>147</fpage>&#x2013;<lpage>212</lpage>. doi: <pub-id pub-id-type="doi">10.1016/bs.apar.2016.09.001</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Buchfink</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Xie</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Huson</surname> <given-names>D. H.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Fast and sensitive protein alignment using DIAMOND</article-title>. <source>Nat. Methods</source> <volume>12</volume>, <fpage>59</fpage>&#x2013;<lpage>60</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nmeth.3176</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cingolani</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Platts</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>L. L.</given-names>
</name>
<name>
<surname>Coon</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Nguyen</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>L.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of <italic>Drosophila melanogaster</italic> strain w1118; iso-2; iso-3</article-title>. <source>Fly (Austin).</source> <volume>6</volume>, <fpage>80</fpage>&#x2013;<lpage>92</lpage>. doi: <pub-id pub-id-type="doi">10.4161/fly.19695</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>D&#x2019;Alessandro</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Rausch</surname> <given-names>R. L.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>New aspects of neotropical polycystic (<italic>Echinococcus vogeli</italic>) and unicystic (<italic>Echinococcus oligarthrus</italic>) echinococcosis</article-title>. <source>Clin. Microbiol. Rev.</source> <volume>21</volume>, <fpage>380</fpage>&#x2013;<lpage>401</lpage>. doi: <pub-id pub-id-type="doi">10.1128/CMR.00050-07</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Danecek</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Auton</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Abecasis</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Albers</surname> <given-names>C. A.</given-names>
</name>
<name>
<surname>Banks</surname> <given-names>E.</given-names>
</name>
<name>
<surname>DePristo</surname> <given-names>M. A.</given-names>
</name>
<etal/>
</person-group>. (<year>2011</year>). <article-title>The variant call format and VCFtools</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>2156</fpage>&#x2013;<lpage>2158</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btr330</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Danecek</surname> <given-names>P.</given-names>
</name>
<name>
<surname>McCarthy</surname> <given-names>S. A.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>BCFtools/csq: haplotype-aware variant consequences</article-title>. <source>Bioinformatics</source> <volume>33</volume>, <fpage>2037</fpage>&#x2013;<lpage>2039</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btx100</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>De Baets</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Dentzien-Dias</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Upeniece</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Verneau</surname> <given-names>O.</given-names>
</name>
<name>
<surname>Donoghue</surname> <given-names>P. C. J.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Constraining the deep origin of parasitic flatworms and host-interactions with fossil evidence</article-title>. <source>Adv. Parasitol.</source>, <volume>90</volume>, <fpage>93</fpage>&#x2013;<lpage>135</lpage>. doi: <pub-id pub-id-type="doi">10.1016/bs.apar.2015.06.002</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Eisenberg</surname> <given-names>J. F.</given-names>
</name>
<name>
<surname>Reid</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Redford</surname> <given-names>K. H.</given-names>
</name>
<name>
<surname>Bonner</surname> <given-names>S. J.</given-names>
</name>
</person-group>, (<year>1989</year>). <source>Mammals of the Neotropics</source>. <publisher-name>University of Chicago Press</publisher-name>, <publisher-loc>Chicago</publisher-loc>.</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Finn</surname> <given-names>R. D.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Pfam: Clans, web tools and services</article-title>. <source>Nucleic Acids Res.</source> <volume>34</volume>, <fpage>D247</fpage>&#x2013;<lpage>D251</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkj149</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gurevich</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Saveliev</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Vyahhi</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Tesler</surname> <given-names>G.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>QUAST: quality assessment tool for genome assemblies</article-title>. <source>Bioinformatics</source> <volume>29</volume>, <fpage>1072</fpage>&#x2013;<lpage>1075</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btt086</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Haag</surname> <given-names>K. L.</given-names>
</name>
<name>
<surname>Gottstein</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Ayala</surname> <given-names>F. J.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>The EG95 antigen of <italic>Echinococcus</italic> spp. contains positively selected amino acids, which may influence host specificity and vaccine efficacy</article-title>. <source>PLoS One</source> <elocation-id>e5362</elocation-id>, <fpage>4</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0005362</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Haag</surname> <given-names>K. L.</given-names>
</name>
<name>
<surname>Zanotto</surname> <given-names>P. M. A.</given-names>
</name>
<name>
<surname>Alves-Junior</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Gasser</surname> <given-names>R. B.</given-names>
</name>
<name>
<surname>Zaha</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Ayala</surname> <given-names>F. J.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Searching for antigen B genes and their adaptive sites in distinct strains and species of the helminth <italic>Echinococcus</italic>
</article-title>. <source>Infect. Genet. Evol.</source> <volume>6</volume>, <fpage>251</fpage>&#x2013;<lpage>261</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.meegid.2005.07.003</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hahn</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Fromm</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Bachmann</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Comparative genomics of flatworms (Platyhelminthes) reveals shared genomic features of ecto- and endoparastic neodermata</article-title>. <source>Genome Biol. Evol.</source> <volume>6</volume>, <fpage>1105</fpage>&#x2013;<lpage>1117</lpage>. doi: <pub-id pub-id-type="doi">10.1093/gbe/evu078</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>H&#xfc;ttner</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Nakao</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Wassermann</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Siefert</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Boomker</surname> <given-names>J. D. F.</given-names>
</name>
<name>
<surname>Dinkel</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2008</year>). <article-title>Genetic characterization and phylogenetic position of <italic>Echinococcus felidis</italic> Ortlepp, 1937 (Cestoda: Taeniidae) from the African lion</article-title>. <source>Int. J. Parasitol.</source> <volume>38</volume>, <fpage>861</fpage>&#x2013;<lpage>868</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.ijpara.2007.10.013</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Johnson</surname> <given-names>W. E.</given-names>
</name>
<name>
<surname>Eizirik</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Pecon-Slattery</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Murphy</surname> <given-names>W. J.</given-names>
</name>
<name>
<surname>Antunes</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Teeling</surname> <given-names>E.</given-names>
</name>
<etal/>
</person-group>. (<year>2006</year>). <article-title>The late Miocene radiation of modern Felidae: a genetic assessment</article-title>. <source>Science</source> <volume>311</volume>, <fpage>73</fpage>&#x2013;<lpage>77</lpage>. doi: <pub-id pub-id-type="doi">10.1126/science.1122277</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Keibler</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Brent</surname> <given-names>M. R.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Eval: a software package for analysis of genome annotations</article-title>. <source>BMC Bioinform.</source> <volume>4</volume>, <elocation-id>50</elocation-id>. doi: <pub-id pub-id-type="doi">10.1186/1471-2105-4-50</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kinkar</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Laurim&#xe4;e</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Sharbatkhori</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Mirhendi</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Kia</surname> <given-names>E. B.</given-names>
</name>
<name>
<surname>Ponce-Gordo</surname> <given-names>F.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>New mitogenome and nuclear evidence on the phylogeny and taxonomy of the highly zoonotic tapeworm <italic>Echinococcus granulosus</italic> sensu stricto</article-title>. <source>Infect. Genet. Evol.</source> <volume>52</volume>, <fpage>52</fpage>&#x2013;<lpage>58</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.meegid.2017.04.023</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Knapp</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Nakao</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Yanagida</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Okamoto</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Saarma</surname> <given-names>U.</given-names>
</name>
<name>
<surname>Lavikainen</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2011</year>). <article-title>Phylogenetic relationships within <italic>Echinococcus</italic> and <italic>Taenia</italic> tapeworms (Cestoda: Taeniidae): an inference from nuclear protein-coding genes</article-title>. <source>Mol. Phylogenet. Evol.</source> <volume>61</volume>, <fpage>628</fpage>&#x2013;<lpage>638</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.ympev.2011.07.022</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Stecher</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Knyaz</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Tamura</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>MEGA X: Molecular evolutionary genetics analysis across computing platforms</article-title>. <source>Mol. Biol. Evol.</source> <volume>35</volume>, <fpage>1547</fpage>&#x2013;<lpage>1549</lpage>. doi: <pub-id pub-id-type="doi">10.1093/molbev/msy096</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Laetsch</surname> <given-names>D. R.</given-names>
</name>
<name>
<surname>Blaxter</surname> <given-names>M. L.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>BlobTools: interrogation of genome assemblies</article-title>. <source>F1000Research</source> <volume>6</volume>, <fpage>1287</fpage>. doi: <pub-id pub-id-type="doi">10.12688/f1000research.12232.1</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lanfear</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Frandsen</surname> <given-names>P. B.</given-names>
</name>
<name>
<surname>Wright</surname> <given-names>A. M.</given-names>
</name>
<name>
<surname>Senfeld</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Calcott</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>PartitionFinder 2: new methods for selecting partitioned models of evolution for molecular and morphological phylogenetic analyses</article-title>. <source>Mol. Biol. Evol.</source> <volume>34</volume>, <elocation-id>msw260</elocation-id>. doi: <pub-id pub-id-type="doi">10.1093/molbev/msw260</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Langmead</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Salzberg</surname> <given-names>S. L.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Fast gapped-read alignment with Bowtie 2</article-title>. <source>Nat. Methods</source> <volume>9</volume>, <fpage>357</fpage>&#x2013;<lpage>359</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nmeth.1923</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Durbin</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Fast and accurate short read alignment with Burrows&#x2013;Wheeler transform</article-title>. <source>Bioinformatics</source> <volume>25</volume>, <fpage>1754</fpage>&#x2013;<lpage>1760</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btp324</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lymbery</surname> <given-names>A. J.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Phylogenetic pattern, evolutionary processes and species delimitation in the genus <italic>Echinococcus</italic>
</article-title>. <source>Adv. Parasitol.</source>, <volume>95</volume>, <fpage>111</fpage>&#x2013;<lpage>145</lpage>. doi: <pub-id pub-id-type="doi">10.1016/bs.apar.2016.07.002</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lymbery</surname> <given-names>A. J.</given-names>
</name>
<name>
<surname>Jenkins</surname> <given-names>E. J.</given-names>
</name>
<name>
<surname>Schurer</surname> <given-names>J. M.</given-names>
</name>
<name>
<surname>Thompson</surname> <given-names>R. C. A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>
<italic>Echinococcus canadensis</italic>, <italic>E. borealis</italic>, and <italic>E. intermedius</italic>
</article-title>. <source>Trends Parasitol.</source> <volume>31</volume>, <fpage>23</fpage>&#x2013;<lpage>29</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.pt.2014.11.003</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maldonado</surname> <given-names>L. L.</given-names>
</name>
<name>
<surname>Assis</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Ara&#xfa;jo</surname> <given-names>F. M. G.</given-names>
</name>
<name>
<surname>Salim</surname> <given-names>A. C. M.</given-names>
</name>
<name>
<surname>Macchiaroli</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Cucher</surname> <given-names>M.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>The <italic>Echinococcus canadensis</italic> (G7) genome: a key knowledge of parasitic platyhelminth human diseases</article-title>. <source>BMC Genomics</source> <volume>18</volume>, <fpage>204</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12864-017-3574-0</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maldonado</surname> <given-names>L. L.</given-names>
</name>
<name>
<surname>Stegmayer</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Milone</surname> <given-names>D. H.</given-names>
</name>
<name>
<surname>Oliveira</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Rosenzvit</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Kamenetzky</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Whole genome analysis of codon usage in</article-title>. <source>Echinococcus. Mol. Biochem. Parasitol.</source> <volume>225</volume>, <fpage>54</fpage>&#x2013;<lpage>56</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.molbiopara.2018.08.001</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McKenna</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Hanna</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Banks</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Sivachenko</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Cibulskis</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Kernytsky</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2010</year>). <article-title>The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data</article-title>. <source>Genome Res.</source> <volume>20</volume>, <fpage>1297</fpage>&#x2013;<lpage>1303</lpage>. doi: <pub-id pub-id-type="doi">10.1101/gr.107524.110</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McManus</surname> <given-names>D. P.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Current status of the genetics and molecular taxonomy of <italic>Echinococcus</italic> species</article-title>. <source>Parasitology</source> <volume>140</volume>, <fpage>1617</fpage>&#x2013;<lpage>1623</lpage>. doi: <pub-id pub-id-type="doi">10.1017/S0031182013000802</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nakao</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Lavikainen</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Yanagida</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Ito</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2013</year>a). <article-title>Phylogenetic systematics of the genus <italic>Echinococcus</italic> (Cestoda: Taeniidae)</article-title>. <source>Int. J. Parasitol.</source> <volume>43</volume>, <fpage>1017</fpage>&#x2013;<lpage>1029</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.ijpara.2013.06.002</pub-id>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nakao</surname> <given-names>M.</given-names>
</name>
<name>
<surname>McManus</surname> <given-names>D. P.</given-names>
</name>
<name>
<surname>Schantz</surname> <given-names>P. M.</given-names>
</name>
<name>
<surname>Craig</surname> <given-names>P. S.</given-names>
</name>
<name>
<surname>Ito</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>A molecular phylogeny of the genus <italic>Echinococcus</italic> inferred from complete mitochondrial genomes</article-title>. <source>Parasitology</source> <volume>134</volume>, <fpage>713</fpage>&#x2013;<lpage>722</lpage>. doi: <pub-id pub-id-type="doi">10.1017/S0031182006001934</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nakao</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Yanagida</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Konyaev</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Lavikainen</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Odnokurtsev</surname> <given-names>V. A.</given-names>
</name>
<name>
<surname>Zaikov</surname> <given-names>V. A.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>b). <article-title>Mitochondrial phylogeny of the genus <italic>Echinococcus</italic> (Cestoda: Taeniidae) with emphasis on relationships among <italic>Echinococcus canadensis</italic> genotypes</article-title>. <source>Parasitology</source> <volume>140</volume>, <fpage>1625</fpage>&#x2013;<lpage>1636</lpage>. doi: <pub-id pub-id-type="doi">10.1017/S0031182013000565</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Narasimhan</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Danecek</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Scally</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Xue</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Tyler-Smith</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Durbin</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data</article-title>. <source>Bioinformatics</source> <volume>32</volume>, <fpage>1749</fpage>&#x2013;<lpage>1751</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btw044</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pozio</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Hoberg</surname> <given-names>E.</given-names>
</name>
<name>
<surname>La Rosa</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Zarlenga</surname> <given-names>D. S.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Molecular taxonomy, phylogeny and biogeography of nematodes belonging to the <italic>Trichinella</italic> genus</article-title>. <source>Infect. Genet. Evol.</source> <volume>9</volume>, <fpage>606</fpage>&#x2013;<lpage>616</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.meegid.2009.03.003</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Quevillon</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Silventoinen</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Pillai</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Harte</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Mulder</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Apweiler</surname> <given-names>R.</given-names>
</name>
<etal/>
</person-group>. (<year>2005</year>). <article-title>InterProScan: protein domains identifier</article-title>. <source>Nucleic Acids Res.</source> <volume>33</volume>, <fpage>W116</fpage>&#x2013;<lpage>W120</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gki442</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Quinlan</surname> <given-names>A. R.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>BEDTools: the Swiss-Army tool for genome feature analysis</article-title>. <source>Curr. Protoc. Bioinforma.</source> <volume>47</volume>, <page-range>11.12.1&#x2013;34</page-range>. doi: <pub-id pub-id-type="doi">10.1002/0471250953.bi1112s47</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Quinlan</surname> <given-names>A. R.</given-names>
</name>
<name>
<surname>Hall</surname> <given-names>I. M.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>BEDTools: a flexible suite of utilities for comparing genomic features</article-title>. <source>Bioinformatics</source> <volume>26</volume>, <fpage>841</fpage>&#x2013;<lpage>842</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btq033</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rausch</surname> <given-names>R. L.</given-names>
</name>
<name>
<surname>Journal</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Jun</surname> <given-names>N.</given-names>
</name>
</person-group> (<year>1967</year>). <article-title>A consideration of infraspecific categories in the genus <italic>Echinococcus</italic> Rudolphi, 1801 (Cestoda&#x202f;: Taeniidae)</article-title>. <source>Genus</source> <volume>53</volume>, <fpage>484</fpage>&#x2013;<lpage>491</lpage>. doi: <pub-id pub-id-type="doi">10.2307/3276706</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rice</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Longden</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Bleasby</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>EMBOSS: The European Molecular Biology Open Software Suite</article-title>. <source>Trends Genet.</source> <volume>16</volume>, <fpage>276</fpage>&#x2013;<lpage>277</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S0168-9525(00)02024-2</pub-id>
</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ronquist</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Teslenko</surname> <given-names>M.</given-names>
</name>
<name>
<surname>van der Mark</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Ayres</surname> <given-names>D. L.</given-names>
</name>
<name>
<surname>Darling</surname> <given-names>A.</given-names>
</name>
<name>
<surname>H&#xf6;hna</surname> <given-names>S.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space</article-title>. <source>Syst. Biol.</source> <volume>61</volume>, <fpage>539</fpage>&#x2013;<lpage>542</lpage>. doi: <pub-id pub-id-type="doi">10.1093/sysbio/sys029</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saarma</surname> <given-names>U.</given-names>
</name>
<name>
<surname>J&#xf5;gisalu</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Moks</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Varcasia</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Lavikainen</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Oksanen</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>A novel phylogeny for the genus <italic>Echinococcus</italic>, based on nuclear data, challenges relationships based on mitochondrial evidence</article-title>. <source>Parasitology</source> <volume>136</volume>, <fpage>317</fpage>&#x2013;<lpage>328</lpage>. doi: <pub-id pub-id-type="doi">10.1017/S0031182008005453</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Safonova</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Bankevich</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Pevzner</surname> <given-names>P. A.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>DIPSPADES: Assembler for highly polymorphic diploid genomes</article-title>. <source>J Comput Biol.</source> <volume>22</volume>(<issue>6</issue>): <fpage>528</fpage>&#x2013;<lpage>545</lpage>. doi: <pub-id pub-id-type="doi">10.1007/978-3-319-05269-4_21</pub-id>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sim&#xe3;o</surname> <given-names>F. A.</given-names>
</name>
<name>
<surname>Waterhouse</surname> <given-names>R. M.</given-names>
</name>
<name>
<surname>Ioannidis</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Kriventseva</surname> <given-names>E. V.</given-names>
</name>
<name>
<surname>Zdobnov</surname> <given-names>E. M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs</article-title>. <source>Bioinformatics</source> <volume>31</volume>, <fpage>3210</fpage>&#x2013;<lpage>3212</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btv351</pub-id>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Slater</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Birney</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Box</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Smith</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Waterman</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Altschul</surname> <given-names>S.</given-names>
</name>
<etal/>
</person-group>. (<year>2005</year>). <article-title>Automated generation of heuristics for biological sequence comparison</article-title>. <source>BMC Bioinform.</source> <volume>6</volume>, <elocation-id>31</elocation-id>. doi: <pub-id pub-id-type="doi">10.1186/1471-2105-6-31</pub-id>
</citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Soares</surname> <given-names>M. do, C. P.</given-names>
</name>
<name>
<surname>Rodrigues</surname> <given-names>A. L. dos S.</given-names>
</name>
<name>
<surname>Moreira Silva</surname> <given-names>C. A.</given-names>
</name>
<name>
<surname>Brito</surname> <given-names>E. M. de F.</given-names>
</name>
<name>
<surname>Gomes-Gouv&#xea;a</surname> <given-names>M. S.</given-names>
</name>
<name>
<surname>Corr&#xea;a</surname> <given-names>I. R. dos S.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>Anatomo-clinical and molecular description of liver neotropical echinococcosis caused by <italic>Echinococcus oligarthrus</italic> in human host</article-title>. <source>Acta Trop.</source> <volume>125</volume>, <fpage>110</fpage>&#x2013;<lpage>114</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.actatropica.2012.09.004</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stemmer</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Burghardt</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Mayer</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Reinicke</surname> <given-names>G. B.</given-names>
</name>
<name>
<surname>W&#xe4;gele</surname> <given-names>H.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>Morphological and genetic analyses of xeniid soft coral diversity (Octocorallia; Alcyonacea)</article-title>. <source>Org. Divers. Evol.</source> <volume>13</volume>, <fpage>135</fpage>&#x2013;<lpage>150</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s13127-012-0119-x</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tamazian</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Dobrynin</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Krasheninnikova</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Komissarov</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Koepfli</surname> <given-names>K.-P.</given-names>
</name>
<name>
<surname>O&#x2019;Brien</surname> <given-names>S. J.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Chromosomer: a reference-based genome arrangement tool for producing draft chromosome sequences</article-title>. <source>Gigascience</source> <volume>5</volume>, <fpage>38</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s13742-016-0141-6</pub-id>
</citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tsai</surname> <given-names>I. J.</given-names>
</name>
<name>
<surname>Zarowiecki</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Holroyd</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Garciarrubio</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Sanchez-Flores</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Brooks</surname> <given-names>K. L.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>The genomes of four tapeworm species reveal adaptations to parasitism</article-title>. <source>Nature</source> <volume>496</volume>, <fpage>57</fpage>&#x2013;<lpage>63</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nature12031</pub-id>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Verster</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>1969</year>). <article-title>A taxonomic revision of the genus Taenia Linnaeus, 1758 s. str</article-title>. <source>J. Vet. Res.</source> <volume>36</volume> (<issue>1</issue>), <fpage>3</fpage>-<lpage>58</lpage>. <uri xlink:href="http://hdl.handle.net/2263/53847">http://hdl.handle.net/2263/53847</uri>.</citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xiao</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Qiu</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Nakao</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>X.</given-names>
</name>
<etal/>
</person-group>. (<year>2006</year>). <article-title>
<italic>Echinococcus shiquicus</italic>, a new species from the Qinghai&#x2013;Tibet plateau region of China: Discovery and epidemiological implications</article-title>. <source>Parasitol. Int.</source> <volume>55</volume>, <fpage>S233</fpage>&#x2013;<lpage>S236</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.parint.2005.11.035</pub-id>
</citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zheng</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Lu</surname> <given-names>G.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>The genome of the hydatid tapeworm <italic>Echinococcus granulosus</italic>
</article-title>. <source>Nat. Genet.</source> <volume>45</volume>, <fpage>1168</fpage>&#x2013;<lpage>1175</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ng.2757</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>