BRIEF RESEARCH REPORT article

Front. Genet., 09 February 2021

Sec. Computational Genomics

Volume 12 - 2021 | https://doi.org/10.3389/fgene.2021.625607

SARS-CoV-2 Early Screening at the Point of Entry: Travelers From Bangladesh to Italy–July 2020

  • National Institute for Infectious Diseases, INMI (Istituto Nazionale per le Malattie Infettive), “Lazzaro Spallanzani” IRCCS (Istituto di ricovero e Cura a Carattere Scientifico), Rome, Italy

Abstract

We report phylogenetic and mutational analysis by NGS of six SARS-CoV-2 strains from patients flying from Bangladesh to Italy (July 2020). Data suggest that no further circulation of such imported strains occurred in Italy, stating the efficacy of early screening at the point of entry and supporting the importance of molecular epidemiology in monitoring the efficacy of control measures.

Introduction

The current outbreak of novel coronavirus (COVID-19) disease has spread across borders through travelers. Thanks to the lockdown measures, closure of unnecessary activities and services, as well as block of traveling from foreign countries was undertaken in Italy from 9th March to 3rd May. As a result, partial control of virus spread was achieved, and subsequently the country gradually returned activities to normal, including travel connections with other countries (ECDC., 2020). In the release phase, early detection of suspect cases at points of entry (POE), including ports, airports and ground crossings and implementation of appropriate control measures are crucial to reduce the risk for igniting new transmission chains (Alm et al., 2020; ECDC., 2020).

Here we report the phylogenetic and mutational analysis of SARS-CoV-2 strains harbored by travelers entering Italy from Bangladesh flying from Dhaka in early July 2020. The analyses were carried out on six samples randomly selected, and the data supported the importance of molecular epidemiology in achieving successful control of new infection waves from imported cases.

Method

Nasopharyngeal-swabs (NPS) from 406 travelers coming from Dhaka on 2 airplanes landed in Rome Fiumicino airport were collected immediately upon disembarkment on July 7th, and sent to the Laboratory of Virology of the “L. Spallanzani” Institute, Rome, for SARS-Cov-2 diagnosis, resulting in 50 laboratory confirmed infections.

The presence of SARS-CoV-2 RNA in clinical samples was detected by a commercial RT-PCR assay [Cobas® SARS-CoV-2 (Roche Diagnostics)].

For sequence analysis, the full genome viral sequencing was performed for available residual samples from 6 patients involved in this outbreak. Nucleic acid extraction was performed by QiaSymphony automatic extractor, then Next Generation Sequencing (NGS) was carried out on Ion Torrent GSS5 platform using Ion AmpliSeq SARS-CoV-2 Panel, following manufacturer's instructions (ThermoFisher). Ethical approval for sequence analysis: no. 70/2018(17/12/2018).

Mean quality Phred score >20 raw reads were selected and trimmed using Trimmomatic software v.0.36 (Bolger et al., 2014). SARS-CoV-2 genomes were assembled using reference-based assembly method, with BWA v.0.7.12 (Li and Durbin, 2009) and Samtools v.1.3.1 (Li et al., 2009). Contigs were then verified using Geneious 2019.2.3. Single Nucleotide Variants (SNV) were called taking all mutations with a coverage ≥50 reads and a frequency >50%, and excluding mutations lying only in the first or last 5 nucleotides of the reads.

Sequences of SARS-CoV-2 strains from Italy and Bangladesh available at 5th October 2020 were retrieved from GISAID, selecting high coverage genomes. Sequences were clustered at 0.03% using CD-HIT v.4.6 software (Fu et al., 2012). Maximum likelyhood phylogenetic analysis was performed with IQ-TREE v.1.6.12, using Transition with invariable sites plus discrete Gamma model (TIM2+I+G) and 1,000 replicates; Wuhan-Hu-1 strain was adopted as phylogenetic outgroup (MN908947.3).

Results

The phylogenetic lineage classification proposed by Rambaut et al. (2020) was used in the phylogenetic analysis, although maintaining, for comparison to previously published reports, also reference to clades reported in GISAID (Elbe and Buckland-Merrett, 2017).

As can be seen in Figure 1, most sequences reported from Bangladesh, retrieved from the GISAID platform, form distinct clusters within B1 and B1.1 (G and GR) clades.

Figure 1

Concerning the study sequences from passengers flying from Bangladesh, all belong to B1.1 (GR) clade; 5 out of 6 of them fall within the Bangladesh-specific GR cluster highlighted by the phylogenetic analysis; the remaining sequence (sequence number = 3) is interspersed with other GR sequences of mixed origin, among which there are also sequences obtained in Italy in the same period.

Thirty-eight Single Nucleotide Variants (SNV), as compared to the reference strain Wuhan (Accession Number: MN908947), were observed in the study sequences: three in the Untranslated Regions (UTR), 15 synonimous and 20 non-synonimous (Table 1).

Table 1

Regionnt PositionRef. *Patient 1Patient 2Patient 3Patient 4Patient 5Patient 6AA change
5′UTR241CTTTTTT–
Orf1ab829CTSyn
1,163ATTTTTI300F
1,202AGN313D
3,037CTTTTTTSyn
3,403GTSyn
4,331CTSyn
5,281CTSyn
5,800GTL1845F
5,869CTSyn
6,019AGSyn
7,743GTS2493I
8,026AGSyn
8,560TCSyn
9,136GTM2957I
10,626CTA3454V
11,083GTL3606F
11,719GASyn
14,408CTTTTTTP4715L
15,656CTT5131I
15,807AGSyn
17,427GTSyn
19,656GTK6464N
S23,403AGGGGGGD614G
25,046CTP1162S
25,047CAP1162C
25,337GTD1259Y
Orf3a25,907GTG172V
Orf7a27,684CTSyn
27,707CTA105V
N28,083GTSyn
28,881GAAAAAAR203K
28,882GAAAAAAR203K
28,883GCCCCCCG204R
29,272CTSyn
29,348GTS359S
3'UTR29,774CT–
29,779GT–

Consensus sequences of study samples: differences vs. Wuhan-Hu-1sequence.

*

Nucleotide positions are referred to Wuhan-Hu-1(reference genome MN908947).

Nt, nucleotide; AA, amino acid; Syn, synonymous substitution; utr, untranslated region; Orf, open reading frame.

More in details, all the strains carry a common set of 7 SNVs: C241T in 5′UTR and C3037T synonymous substitution in ORF1ab, that are the two most abundant mutations found in Bangladesh sequences and often found simultaneously, according to Ahmed Shishir et al. (2020). 14408 C>T and 23403 A>G are two additional non-synonimous mutations, often found simultaneously, leading to P4715L in ORF1ab and D614G changes in Spike protein (the signature mutation for G clade). Finally, the SNVs 28881 G>A, 28882 G>A and 28883 G>C, leading to R203K and G204R changes in Nucleocapsid protein.

Interestingly the non-synonimous SNV 1163 A>T (I300F) in ORF1 is detected in all sequences here described except in patient 3, that is not included in the Bangladesh-specific GR cluster.

Discussion

As emerged from a previous study, a high percentage of virus sequences isolated in India and Bangladesh are closely related to European and US sequences carrying the mutations typical of the G clade (D614G in S and P4715L in Nsp12) (Islam et al., 2020). Moreover, the sequences reported here carry some of the mutations highly prevalent in Bangladesh sequences available on GISAID, among which the I300F mutation (Ahmed Shishir et al., 2020). This is found to affect the structural stability of Nsp2 (metyltransferase like domain), modulating host cell survival strategy (Ahmed Shishir et al., 2020), and deserves further attention.

Since the phylogenetic tree includes all the sequences available by the 5th October from Bangladesh and Italy (0.2% of positive cases in Italy were sequenced at that time), the confinement of most (n = 5) sequences within the Bangladesh-specific GR cluster suggests effective containment and despite a limit of this study is represented by the limited number of genomes analyzed (12% of positive cases detected in two flights), no further circulation of virus imported from this country occurred after its importation on 7th of July to Rome.

In a context where travel and business relationships between different countries/continents favor the spread of the virus, sequencing and phylogenetic analysis allows to clearly recognize the cluster of imported sequences, showing strong genetic links with other sequences from the country of origin, and no further circulation in the country of destination. Therefore, this data show how sequencing of whole-genome of SARS-CoV-2 and phylogenetic analysis are of great support to molecular epidemiology. These results additionally may provide accurate information about the fate of imported viral strains of the virus, hence inform about the efficacy of implemented control measures.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: https://www.gisaid.org/, EPI_ISL_590693; https://www.gisaid.org/, EPI_ISL_590694; https://www.gisaid.org/, EPI_ISL_590695; https://www.gisaid.org/, EPI_ISL_590696; https://www.gisaid.org/, EPI_ISL_590697; https://www.gisaid.org/, EPI_ISL_590698.

Ethics statement

The studies involving human participants were reviewed and approved by ethics committee of INMI (Ethical approval: no. 70/2018(17/12/2018). Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

BB and MR: conceptualization. MR, MV, and EL: methodology. CG and FM: software. CG and EG: formal analysis. MR: investigation. FV and SL: resources. MR: writing—original draft preparation. BB and MC: writing—review and editing. CG: visualization. BB: supervision. MC and AD: project administration and funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by funds to the Istituto Nazionale per le Malattie Infettive (INMI) Lazzaro Spallanzani IRCCS, Rome, Italy, from the Ministero della Salute (Ricerca Corrente, linea 1; COVID-2020-12371817), the European Commission–Horizon 2020 (EU project 101003544–CoNVat; EU project 101005075-KRONO) and the European Virus Archive–GLOBAL (Grants Nos. 653316 and 871029).

Acknowledgments

We gratefully acknowledge the contributors of genome sequences of the newly emerging coronavirus, i.e., the Originating and Submitting Laboratories, for sharing their sequences and other metadata through the GISAID Initiative.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    Ahmed ShishirT.Bin NaserI.FaruqueS. M. (2020). In silico comparative genomics of SARS-CoV-2 to determine the source and diversity of the pathogen in Bangladesh. bioRxiv.10.1101/2020.07.20.212563

  • 2

    AlmE.BrobergE. K.ConnorT.HodcroftE. B.KomissarovA. B.Maurer-StrohS.et al. (2020). Geographical and temporal distribution of SARS-CoV-2 clades in the WHO European region, January to June 2020. Euro Surveill. 25:2001410. 10.2807/1560-7917.ES.2020.25.32.2001410

  • 3

    BolgerA. M.LohseM.UsadelB. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics30, 2114–2120. 10.1093/bioinformatics/btu170

  • 4

    ECDC. (2020). Coronavirus Disease 2019 (COVID-19) in the EU/EEA and the UK – Tenth Update What is New in This Update? What Are the Risks Being Assessed in this Update? European Centre for Disease Prevention and Control.

  • 5

    ElbeS.Buckland-MerrettG. (2017). Data, disease and diplomacy: GISAID's innovative contribution to global health. Glob Challen. 1, 33–46. 10.1002/gch2.1018

  • 6

    FuL.NiuB.ZhuZ.WuS.LiW. (2012). CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics28, 3150–3152. 10.1093/bioinformatics/bts565

  • 7

    IslamO. K.Al-EmranH. M.HasanM. S.AnwarA.JahidM. I. K.HossainM. A. (2020). Emergence of European and North American mutant variants of SARS-CoV-2 in South-East Asia. Transbound Emerg Dis.1–9. 10.1111/tbed.13748

  • 8

    LiH.DurbinR. (2009). Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics25, 1754–1760. 10.1093/bioinformatics/btp324

  • 9

    LiH.HandsakerB.WysokerA.FennellT.RuanJ.HomerN.et al. (2009). The sequence alignment/map format and SAMtools. Bioinformatics25, 2078–2079. 10.1093/bioinformatics/btp352

  • 10

    RambautA.HolmesE. C.HillV.OTooleA.McCroneJ.RuisC.et al. (2020). A dynamic nomenclature proposal for SARS-CoV-2 to assist genomic epidemiology. bioRxiv. 10.1101/2020.04.17.046086

Summary

Keywords

SARS-CoV-2, next generation genome sequencing, mutations, phylogenetic analysis, COVID-19, molecular epidemiology, early detection at point of entry

Citation

Rueca M, Di Caro A, Gruber CEM, Messina F, Giombini E, Valli MB, Lalle E, Lanini S, Vairo F, Capobianchi MR and Bartolini B (2021) SARS-CoV-2 Early Screening at the Point of Entry: Travelers From Bangladesh to Italy–July 2020. Front. Genet. 12:625607. doi: 10.3389/fgene.2021.625607

Received

03 November 2020

Accepted

18 January 2021

Published

09 February 2021

Volume

12 - 2021

Edited by

Sheikh A. Rahman, Emory University, United States

Reviewed by

Najmul Haider, Royal Veterinary College (RVC), United Kingdom; Mahmuda Yasmin, University of Dhaka, Bangladesh; Rebecca Rockett, The University of Sydney, Australia

Updates

Copyright

*Correspondence: Maria Rosaria Capobianchi

This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics