<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2017.01083</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>HIV Progression Depends on Codon and Amino Acid Usage Profile of Envelope Protein and Associated Host-Genetic Influence</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Roy</surname> <given-names>Ayan</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn003"><sup>&#x02020;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Banerjee</surname> <given-names>Rachana</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn003"><sup>&#x02020;</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Basak</surname> <given-names>Surajit</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<xref ref-type="author-notes" rid="fn003"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/418939/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Botany, Bioinformatics Facility, University of North Bengal</institution> <country>Siliguri, India</country></aff>
<aff id="aff2"><sup>2</sup><institution>Structural Biology and Bio-Informatics Division, CSIR-Indian Institute of Chemical Biology</institution> <country>Kolkata, India</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Molecular Biology and Bioinformatics, Tripura University</institution> <country>Agartala, India</country></aff>
<aff id="aff4"><sup>4</sup><institution>Bioinformatics Centre, Tripura University</institution> <country>Agartala, India</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Hirofumi Akari, Kyoto University, Japan</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Yorifumi Satou, Kumamoto University, Japan; Shigeyoshi Harada, National Institute of Infectious Diseases, Japan</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Surajit Basak <email>basaksurajit&#x00040;gmail.com</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Virology, a section of the journal Frontiers in Microbiology</p></fn>
<fn fn-type="other" id="fn003"><p>&#x02020;These authors have contributed equally to this work.</p></fn></author-notes>
<pub-date pub-type="epub">
<day>15</day>
<month>06</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>8</volume>
<elocation-id>1083</elocation-id>
<history>
<date date-type="received">
<day>17</day>
<month>03</month>
<year>2017</year>
</date>
<date date-type="accepted">
<day>29</day>
<month>05</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2017 Roy, Banerjee and Basak.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Roy, Banerjee and Basak</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>Acquired immune deficiency syndrome (AIDS) is a spectrum of conditions caused by infection with the human immunodeficiency virus (HIV). Two types of HIV have been characterized: HIV-1 and HIV-2. The present study investigated whether evolutionary selection pressure differs between rapid progressor (RP), slow progressor (SP), and long-term non-progressor (LTNP) of HIV-I infected individuals. An unexpected association between the evolutionary rate of substitution in envelope (env) gene and disease progression is observed. Our present study suggests that env genes of LTNP are subject to unusually strong functional constraint with respect to RP. We also observed that the three categories of env genes i.e., RP, SP, and LTNP, had their own characteristic pattern of amino acid usage and SP and LTNP sequences shared similar patterns of amino acid usage different from RP sequences and evolutionary rate significantly influenced the amino acid usage pattern of the three different types of env gene sequences. It was also noted that the evolutionary rate for the glycosylation sites of LTNP and SP sequences were even significantly less than the RP sequences. Comparative analysis on the influence of human host on the three categories of env genes are well correlated with the rates of disease progression suggesting the adaptive strategies of the viruses for successful residence and infection. Host associated selective constraints appeared most relaxed on the RP sequences and strongest in LTNP sequences. The present study clearly portrays how evolutionary selection pressure differs between three categories of env genes i.e., RP, SP, and LTNP. The env genes, coding for the env glycoproteins, experience severe selection constraints from the host due to their constant exposure to the host immune system. In this perspective it might be suggested that env gene evolution occurs mainly by negative selection with the occurrence of mutation that might not reach fixation in the viral population. This work also confers a deeper insight into the crucial effects of host factors that govern the overall progression of HIV infection.</p></abstract>
<kwd-group>
<kwd>rapid progressor</kwd>
<kwd>slow progressor</kwd>
<kwd>long-term non-progressor</kwd>
<kwd>evolutionary rate</kwd>
<kwd>disease progression</kwd>
</kwd-group>
<counts>
<fig-count count="4"/>
<table-count count="0"/>
<equation-count count="2"/>
<ref-count count="52"/>
<page-count count="8"/>
<word-count count="6260"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Human immunodeficiency viruses (HIVs) are one of the most crucial members of the retroviral family <italic>Retroviridae</italic>. Among the two main types of HIVs i.e., HIV-1 and HIV-2, HIV-1 has been characterized as the major causative agent of acquired immune deficiency syndrome (AIDS) (Blattner et al., <xref ref-type="bibr" rid="B3">1988</xref>; Weiss, <xref ref-type="bibr" rid="B45">1993</xref>).</p>
<p>Natural variability of HIV-1 is the key underlying the complex biology of the menacing virus. Evolutionary and epidemiological history of HIV can be investigated through phylogenetic analysis of erratic forms of the virus. Interestingly, due to rapid availability of nucleotide sequence data of viral genes, it is nowadays possible to illustrate a detailed phylogenetic relationship of viruses like HIV. Phylogenetic arrangements of HIV-1 are commonly carried out using nucleotide sequences of different sub-genomic regions of the same HIV-1 genome i.e., gag, pol, and envelope (env). This approach has already revealed unique inter-subtype recombinant forms of virus isolates (McCutchan, <xref ref-type="bibr" rid="B26">2006</xref>). Envelope (env) gene encodes known targets for cytotoxic T lymphocytes and neutralizing antibodies (Goudsmit et al., <xref ref-type="bibr" rid="B14">1988</xref>; Chesebro et al., <xref ref-type="bibr" rid="B9">1991</xref>). Despite extensive study of sequence variation of env gene of HIV-1, the exact impact of selection in governing the patterns of variation in the env gene still remains somewhat obscure. In some earlier studies, based on lower frequency of synonymous nucleotide substitutions within the V3 loop in the env genes, it has been circuitously verified that the variation in env gene is maintained by selection for antigenic diversity (Simmonds et al., <xref ref-type="bibr" rid="B42">1991</xref>; Bonhoeffer et al., <xref ref-type="bibr" rid="B4">1995</xref>). Theoretically, it has also been implicated that selection pressure and mutability dynamics are the vital forces that affect viral fitness and robustness (Brown, <xref ref-type="bibr" rid="B5">1997</xref>). It has been hypothesized that negative (purifying) selection constraints, operational in HIV-1, execute a crucial role in devising molecular evolutionary patterns of HIV-1 in contrast to positive (diversifying) selection which has been reported to play a minor role (Seo et al., <xref ref-type="bibr" rid="B38">2002</xref>; Drummond et al., <xref ref-type="bibr" rid="B11">2003</xref>).</p>
<p>Investigations pertaining to viral divergence in HIV-1 patients, associated with varying rates of progression of disease, have often produced interesting and conflicting results (Ganeshan et al., <xref ref-type="bibr" rid="B13">1997</xref>; Arts and Qui&#x000F1;ones-Mateu, <xref ref-type="bibr" rid="B1">2003</xref>; Rangel et al., <xref ref-type="bibr" rid="B33">2003</xref>). Some researchers have noted that non-synonymous substitutions in the env gene have been higher in long-term non-progressor (LTNP) whereas, there has been no difference between slow and normal progressors (NPs) regarding the influence of synonymous substitution (Strunnikova et al., <xref ref-type="bibr" rid="B43">1995</xref>; Bagnarelli et al., <xref ref-type="bibr" rid="B2">1999</xref>). However, Markham and co-workers observed opposite results with greater accumulation of non-synonymous substitutions in NPs (Markham et al., <xref ref-type="bibr" rid="B25">1998</xref>). The event of T-cell activation can be effectively employed to estimate the progression to AIDS in HIV-1 infected individuals and might be utility in apt assessment of viral replication rates and evolutionary complexities. Consequently, activation of immune responses might enforce critical constraints on the evolutionary tricks and viral generation rates of HIV. However, such effects yet remain unexplored from the evolutionary perspective. Extensive investigations pertaining to evolutionary signatures might prove handy in elucidating the patterns of molecular adaptation of HIV in human host. Variations in synonymous substitution rates reflect changes in generation time or mutation rate, while rate of non-synonymous mutations tend to be affected by changes in selective pressure and effective population size (Lemey et al., <xref ref-type="bibr" rid="B23">2007</xref>).</p>
<p>Envelope (env) gene is the fastest evolving one in the HIV-1 genome. Conflicting selective pressures shape the evolutionary dynamics of the virus (Korber et al., <xref ref-type="bibr" rid="B19">2000</xref>; Ross and Rodrigo, <xref ref-type="bibr" rid="B35">2002</xref>; Yoshida et al., <xref ref-type="bibr" rid="B50">2011</xref>). In the present study, investigations pertaining to the evolutionary traits of the env genes representing rapid progressor (RP), slow progressor (SP), and LTNP human patients (available in public domain), have been performed extensively through detailed phylogenetic analysis, followed by subsequent estimation of synonymous and non-synonymous substitution rates. Rapid-progressors are associated with speedy development of AIDS within 3 years of infection whereas; slow-progressors are characterized by a comparatively slower gradual onset of AIDS after seroconversion that might take a span 3&#x02013;10 years (Kumar, <xref ref-type="bibr" rid="B20">2013</xref>) to develop. Comparative analysis of relative synonymous codon usage (RSCU) patterns of the three categories of env genes marked by varying degrees of disease progression i.e., RP, SP, and LTNP types and their similarity index with human host has also been executed in the present endeavor with a motif to address the adaptive strategies of the viruses for successful residence and infection. Results pertaining to RSCU patterns and evolutionary dynamics might prove utility to unravel the molecular underpinnings of viral adaptation and infection in human host and excavate subtle discrepancies among the viral types associated with varying rates of disease progression.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Sequence retrieval</title>
<p>All available env gene sequences of HIV-1 subtype B were retrieved from HIV Database (<ext-link ext-link-type="uri" xlink:href="http://www.hiv.lanl.gov/">http://www.hiv.lanl.gov/</ext-link>) representing three types of patient categories i.e., RP, SP, and LTNP patients. Redundant and erroneous sequences (sequences with internal stop codons) were removed to avoid stochastic variations and sampling errors (Wright, <xref ref-type="bibr" rid="B47">1990</xref>). A comprehensive set of 264 coding sequences constituted final dataset for our analysis (Supplementary Material, Data Sheet <xref ref-type="supplementary-material" rid="SM2">1</xref>).</p>
</sec>
<sec>
<title>Phylogenetic analysis</title>
<p>Phylogenetic analysis provides the ancestral relationship of a set of sequences. It involves the construction of a tree, where the nodes indicate separate evolutionary paths, and the lengths of the branches give an estimate of how distantly related the sequences represented by those branches are. In the present study, all the env genes were aligned using the Clustal Omega program (<ext-link ext-link-type="uri" xlink:href="https://www.ebi.ac.uk/Tools/msa/clustalo/">https://www.ebi.ac.uk/Tools/msa/clustalo/</ext-link>). The resultant multiple sequence alignments were subsequently used to construct the neighbor-joining method based phylogenetic tree with 500 bootstrap replicates. MEGA 7 was used for phylogenetic analysis (Kumar et al., <xref ref-type="bibr" rid="B21">2016</xref>). Here, we have constructed an unrooted tree, where, the distances and relationships between the taxa have been plotted without making any assumption concerning their descent.</p>
</sec>
<sec>
<title>Estimation of relative synonymous codon usage</title>
<p>Relative synonymous codon usage (RSCU) (Sharp et al., <xref ref-type="bibr" rid="B41">1986</xref>) is calculated as the ratio of the observed frequency of a codon to the expected frequency if codon usage was uniform within a synonymous codon group.</p>
<p>Many genes display a non-random usage of synonymous codons for specific amino acids. A measure of the extent of this non-randomness is given by the RSCU (Sharp and Li, <xref ref-type="bibr" rid="B40">1986</xref>). It is the ratio of observed frequency of the codons with respect to the expected frequency of the same codon if codon usage was uniform within a synonymous codon group.</p>
<p>RSCU is calculated as:</p>
<disp-formula id="E1"><mml:math id="M1"><mml:mrow><mml:mi>R</mml:mi><mml:mi>S</mml:mi><mml:mi>C</mml:mi><mml:mi>U</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>F</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>q</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mi>c</mml:mi><mml:mi>y</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mi>o</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>d</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mtable columnalign='left'><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:mi>E</mml:mi><mml:mi>x</mml:mi><mml:mi>p</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>t</mml:mi><mml:mi>e</mml:mi><mml:mi>d</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mi>f</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>q</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mi>c</mml:mi><mml:mi>y</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mi>o</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>d</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr columnalign='left'><mml:mtd columnalign='left'><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>d</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mi>u</mml:mi><mml:mi>s</mml:mi><mml:mi>a</mml:mi><mml:mi>g</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mi>w</mml:mi><mml:mi>a</mml:mi><mml:mi>s</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mi>u</mml:mi><mml:mi>n</mml:mi><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mi>m</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula>
<p>Relative synonymous codon usage (RSCU) values &#x0003E; 1 indicate that the observed frequency of synonymous codons is more compared to the expected frequency and lower than one indicates the opposite (dos Reis et al., <xref ref-type="bibr" rid="B10">2003</xref>).</p>
<p>RSCU values of the 59 codons [excluding the single synonymous codons AUG (Met) and UGG (Trp) and the three termination codons] of RP, SP, and LTNP env gene sequences were calculated using CodonW (Ver. 1.4.2) software (<ext-link ext-link-type="uri" xlink:href="http://www.molbiol.ox.ac.uk/cu">http://www.molbiol.ox.ac.uk/cu</ext-link>) (Peden, <xref ref-type="bibr" rid="B32">2000</xref>). Codon usage frequencies of human host (<italic>Homo sapiens</italic>) were obtained from the Codon Usage Database (<ext-link ext-link-type="uri" xlink:href="http://www.kazusa.or.jp/codon/">http://www.kazusa.or.jp/codon/</ext-link>) (Nakamura et al., <xref ref-type="bibr" rid="B29">2000</xref>).</p>
</sec>
<sec>
<title>Assessment of similarity index</title>
<p>Viral genomes are relatively smaller in size and largely rely on the host to execute crucial biological activities like replication, protein synthesis and transmission (Nasrullah et al., <xref ref-type="bibr" rid="B30">2015</xref>). In this pretext, it has been suggested that viral robustness, survival and evasion of host&#x00027;s immune signals and responses largely depend on the interplay of codon usage patterns of the concerned virus and its respective host (Shackelton et al., <xref ref-type="bibr" rid="B39">2006</xref>; Moratorio et al., <xref ref-type="bibr" rid="B28">2013</xref>). Here, we have considered the similarity index to understand the influence of host genome on the adaptability of virus genome inside the host. The influence of the overall codon usage pattern of the host on the formation of the overall codon usage of the virus is defined as the similarity index.</p>
<p>Relative synonymous codon usage (RSCU) values of the three different types of env gene sequences i.e., RP, SP, and LTNP were compared with that of human host in order to assess the influence of human host system in shaping the patterns of codon usage among the env types. The parameter similarity index, <italic>D(A,B)</italic> (Nasrullah et al., <xref ref-type="bibr" rid="B30">2015</xref>) is computed as follows:</p>
<disp-formula id="E2"><mml:math id="M2"><mml:mtable columnalign='left'><mml:mtr><mml:mtd><mml:mi>R</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>A</mml:mi><mml:mo>,</mml:mo><mml:mi>B</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mtext>&#x0200A;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x0200A;</mml:mtext><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>59</mml:mn></mml:mrow></mml:munderover><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mstyle><mml:mi>X</mml:mi><mml:msub><mml:mi>b</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:msqrt><mml:mrow><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mtext>&#x0200A;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x0200A;</mml:mtext><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mtext>&#x0200A;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x0200A;</mml:mtext><mml:mn>59</mml:mn></mml:mrow></mml:munderover><mml:mrow><mml:msubsup><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:msubsup><mml:mi>X</mml:mi><mml:mstyle displaystyle='true'><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mtext>&#x0200A;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x0200A;</mml:mtext><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mtext>&#x0200A;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x0200A;</mml:mtext><mml:mn>59</mml:mn></mml:mrow></mml:munderover><mml:mrow><mml:msubsup><mml:mi>b</mml:mi><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mstyle></mml:mrow></mml:mstyle></mml:mrow></mml:msqrt></mml:mrow></mml:mfrac></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>D</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>A</mml:mi><mml:mo>,</mml:mo><mml:mi>B</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x02212;</mml:mo><mml:mi>R</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>A</mml:mi><mml:mo>,</mml:mo><mml:mi>B</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>R(A,B)</italic> refers to the cosine value of an included angle between A and B spatial vectors and represents the similarity between particular env type of HIV and overall codon usage pattern of human host. <italic>a</italic><sub><italic>i</italic></sub> signifies the RSCU value for a particular codon among the pool of 59 codons for every specific type of env (i.e., RP, SP, and LTNP) protein coding sequence. <italic>b</italic><sub><italic>i</italic></sub> indicates the RSCU value for the same codon in case of human host. <italic>D(A,B)</italic> signifies the probable impact of human codon usage patterns on the concerned env types of HIV. The value of similarity index, <italic>D(A,B)</italic>, has been reported to lie between 0 to 1.0 (Zhou et al., <xref ref-type="bibr" rid="B52">2013</xref>).</p>
</sec>
<sec>
<title>Multivariate analyses on amino acid usage</title>
<p>Correspondence analysis (COA) (Peden, <xref ref-type="bibr" rid="B32">2000</xref>; <ext-link ext-link-type="uri" xlink:href="http://www.molbiol.ox.ac.uk/cu">http://www.molbiol.ox.ac.uk/cu</ext-link>) was used to investigate the major trend in amino acid usage variation among the env genes. Since amino acid usage by its very nature is multivariate, it is necessary to analyse this data with multivariate statistical techniques i.e., COA. Correspondence analysis (COA) is an ordination technique that identifies the major trends in the variation of the data and distributes genes along continuous axes in accordance with these trends. It has the advantage of not to make any assumption that the data falls into discrete clusters and therefore represent continuous variation accurately. Correspondence analysis (COA) on relative amino acid usage (RAAU) of env gene sequences was executed employing the CodonW program.</p>
</sec>
<sec>
<title>Evolutionary rate calculation</title>
<p>The ratio (&#x003C9;) of rate of non-synonymous substitutions per non-synonymous site (Ka) to rate of synonymous substitutions per synonymous site (Ks) indicates the impact of evolution on a gene segment. &#x003C9; &#x0003E; 1 indicates diversifying (positive) selection whereas, &#x003C9; &#x0003C; 1 signifies purifying (negative) selection (Roy et al., <xref ref-type="bibr" rid="B36">2015</xref>). The evolutionary rates of the orthologous env genes representing all three types under analysis i.e., RP, SP, and LTNP (with reference to progressors) were calculated using Codeml program included in the PAML software package (ver. 4.5) (Nei and Gojobori, <xref ref-type="bibr" rid="B31">1986</xref>; Yang, <xref ref-type="bibr" rid="B49">2007</xref>) (<ext-link ext-link-type="uri" xlink:href="http://abacus.gene.ucl.ac.uk/software/paml.html">http://abacus.gene.ucl.ac.uk/software/paml.html</ext-link>) with runmode &#x0003D; &#x02212;2 and CodonFreq &#x0003D; 1.</p>
<p>Evolutionary rate of each individual residue for a given env gene sequence was calculated using SWAKK server (<ext-link ext-link-type="uri" xlink:href="http://ibl.mdanderson.org/swakk/">http://ibl.mdanderson.org/swakk/</ext-link>) (Liang et al., <xref ref-type="bibr" rid="B24">2006</xref>). It estimates the ratio of non-synonymous to synonymous substitution rates (Ka/Ks) between a pair of protein-coding DNA sequences, by a sliding 3D window analysis.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Phylogenetic profiling of various categories of env genes</title>
<p>Phylogenetic analysis was performed to investigate the evolutionary relationship of 264 env gene sequences of HIV-1 subtype B, representing three different categories of patients (RP, SP, and LTNP). The phylogenetic tree revealed three major lineages where mostly three categories of env gene sequences i.e., RP, SP, and LTNP have been clustered (Figure <xref ref-type="fig" rid="F1">1</xref>).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Neighbor-joining method based phylogenetic tree of the env genes (LTNP, SP, and RP). Red colored lines represent Long term non-progressor (LTNP) sequences. Blue colored lines refer to the Rapid progressor (RP) sets. Green colored lines depict the Slow progressor (SP) sequences.</p></caption>
<graphic xlink:href="fmicb-08-01083-g0001.tif"/>
</fig>
</sec>
<sec>
<title>Trends in amino acid usage of env genes representing varying rates of disease progression</title>
<p>Correlating the observed phylogenetic profile of env genes with RAAU pattern appeared necessary in order to efficiently explore the impact of the various categories of env genes on gene sequence diversity. Accordingly, CoA on amino acid usage was performed using 264 env gene sequences representing three different categories of disease progression genes of HIV-1 subtype B. COA generated two separate clusters (Figure <xref ref-type="fig" rid="F2">2</xref>). Figure <xref ref-type="fig" rid="F2">2</xref> shows the distribution of the genes along two major axes of amino acid usage variation.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Distribution of env genes along the two major axes of correspondence analysis (COA) based on RAAU data. x-axis- Axis 1 of RAAU; y-axis- Axis 2 of RAAU. Red colored square boxes represent Long term non-progressor (LTNP) sequences. Blue colored square boxes depict Rapid progressor (RP) sequences. Green colored square boxes refer to Slow progressor (SP) sequences.</p></caption>
<graphic xlink:href="fmicb-08-01083-g0002.tif"/>
</fig>
<p>It was perceptible from the amino acid usage based COA that SP and LTNP sequences shared similar patterns of amino acid usage different from RP sequences which formed a discrete cluster along Axis 1 (Figure <xref ref-type="fig" rid="F2">2</xref>). It was apparent from Figure <xref ref-type="fig" rid="F2">2</xref> that the three categories of env genes i.e., RP, SP, and LTNP, had their own characteristic pattern of amino acid usage and the env genes of SP exhibited greater resemblance with env genes of LTNP in terms of amino acid usage pattern. It was also clear that the results of COA were in absolute accordance with phylogenetic profile (Figure <xref ref-type="fig" rid="F1">1</xref>).</p>
</sec>
<sec>
<title>Impact of evolutionary selection pressure on three categories of env genes</title>
<p>Earlier, Canducci et al. (<xref ref-type="bibr" rid="B7">2009</xref>) observed that evolutionary rate of HIV-1 env gene varies between NP and LTNP patients. The evolutionary rate (&#x003C9;) for the env sequences of RP, SP, and LTNP type was found to correlate significantly with Axis 1 (<italic>R</italic> &#x0003D; &#x02212;0.31, <italic>P</italic> &#x0003C; 0.01) of RAAU data. It indicates that evolutionary selection pressure significantly influenced the amino acid usage pattern of the three different types of env gene sequences.</p>
<p>It was evident from COA of amino acid usage that env gene sequences belonging to the category SP and LTNP segregated from RP along Axis 1 of RAAU. Significant negative correlation of &#x003C9; (Ka/Ks) with Axis 1 of RAAU indicated that the average Ka/Ks values of SP and LTNP sequences would be lesser than the average Ka/Ks value of RP sequences. It was indeed found that the average Ka/Ks value of LTNP sequences (0.48) was significantly less than that of average Ka/Ks value of RP sequences (0.58) (<italic>P</italic> &#x0003C; 0.01). We also observed that the average Ka/Ks value of SP sequences (0.49) was significantly less than that of RP sequences (0.58) (<italic>P</italic> &#x0003C; 0.01). But, the difference between the average Ka/Ks values of LTNP and SP sequences was observed to be statistically insignificant. Considering the fact that evolutionary rate might vary depending on the functional constraints, SWAKK web server (<ext-link ext-link-type="uri" xlink:href="http://ibl.mdanderson.org/swakk/">http://ibl.mdanderson.org/swakk/</ext-link>) was employed to calculate the evolutionary rate for each single amino acid residue in the concerned protein sequences. Extensive glycosylation of HIV-1 envelope proteins (env) is known to play an important role in evasion of host immune response. Average Ka/Ks value for the glycosylation sites of RP sequences were compared with that of the average Ka/Ks values of the LTNP and SP sequences. Interestingly, it was noted that the average Ka/Ks values for the glycosylation sites of LTNP (0.37) and SP (0.48) sequences were significantly less than that of the average Ka/Ks value of RP sequences (0.62) (<italic>P</italic> &#x0003C; 0.01). It was also noteworthy that the average Ka/Ks value of glycosylation sites for LTNP sequences was significantly less than that of SP sequences (<italic>P</italic> &#x0003C; 0.01; Figure <xref ref-type="fig" rid="F3">3</xref>). Such an observation clearly indicates the functional importance and vitality of glycosylation sites in case of HIV infection.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Variation of Ka/Ks of env genes (LTNP, SP and RP) and respective glycosylation sites. LTNP, Long term non-progressor; SP, Slow progressor; RP, Rapid progressor. The blue line refers to the plot of the average Ka/Ks values of the full env sequences of LTNP, SP, and RP. The red line represents the plot of the average Ka/Ks values of the associated glycosylation sites of LTNP, SP, and RP.</p></caption>
<graphic xlink:href="fmicb-08-01083-g0003.tif"/>
</fig>
</sec>
<sec>
<title>Influence of host machinery on the disease progression of HIV based on RSCU</title>
<p>In the present study we have considered the similarity index parameter (Materials and Methods) to understand the influence of host genome on the adaptability of virus genome inside the host. Methodical inspection of similarity index of the three concerned categories of env gene sequences with human host revealed that the selection pressure due to human host was more severe on the LTNP sequences in comparison to the SP and RP sequence sets (Figure <xref ref-type="fig" rid="F4">4</xref>). Host associated selective constraints appeared most relaxed on the RP sequences as has been evident from Figure <xref ref-type="fig" rid="F4">4</xref>. Difference between similarity index values of the respective LTNP, SP, and RP sequences appeared to be statistically significant and the pattern of variation among them seemed distinct enough to infer that the LTNP sequences were under a stronger selective impact of human host.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Similarity index [D(A,B)] of env genes (LTNP, SP, and RP) with respect to human host. Orange bar refers to similarity index value [D(A,B)] of Long term non-progressor (LTNP) sequences. Blue bar represents the similarity index value [D(A,B)] of the Slow progressor (SP) sets. Green bar depicts the similarity index value [D(A,B)] of the Rapid progressor (RP) sequences.</p></caption>
<graphic xlink:href="fmicb-08-01083-g0004.tif"/>
</fig>
<p>Apart from estimating similarity index, we extended our analysis pertaining to the usage of identical codons among the three diverse types of env gene sequences with human host. Codons were defined as over-represented (RSCU &#x0003E; 1.6) and under-represented (RSCU &#x0003C; 0.6) as per scheme followed by Wong et al. (<xref ref-type="bibr" rid="B46">2010</xref>). Similar codon usage pattern was inferred when a particular codon was found to display RSCU values &#x0003C; 0.6 or more than 1.6 or found to fall within a range of 0.6 to 1.6 for both human host and the respective env type i.e., RP, SP, and LTNP. Interestingly, it was evident after careful inspection that LTNP sequences shared the highest frequency of similarly selected codons (36 codons out of 59 codons) with human host system in comparison to SP (34 codons out of 59 codons) and RP (32 codons out of 59 codons) types (Supplementary Table <xref ref-type="supplementary-material" rid="SM1">1</xref>). Occurrence of higher frequencies of identically shared codons with human host in LTNP might eventually lead to a better adaptability and longer abode of the concerned LTNP in human cellular environment.</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>Intragenomic and intergenomic variations pertaining to codon and amino acid usage patterns can be well explained in the light of multivariate statistical analysis. Correspondence analysis (CoA) is one such type of comprehensive statistical tool which highlights the major variations among codon and amino acid usage data and places them in accordance with such observed variations (Greenacre, <xref ref-type="bibr" rid="B15">1984</xref>). CoA on the basis of RAAU of the three concerned types of env gene sequences revealed their unique amino acid compositional features and separated them according to their diverse compositional traits along the two principle axes of separation of genes i.e., Axis 1 and Axis 2 of RAAU data (Figure <xref ref-type="fig" rid="F2">2</xref>). SP and LTNP sequences, to certain extent, had identical amino acid usage features as was reflected from COA on RAAU data (Figure <xref ref-type="fig" rid="F2">2</xref>). However, RP sequences clustered separately along Axis 1 signifying completely diverse amino acid usage patterns from SP and LTNP sequences. Such contrasting features of amino acid composition of RP sequences might have an implication in their rapid rate of disease progression in human host, characterized by a short stay in host cellular environment and speedy proliferation and infection (Kumar, <xref ref-type="bibr" rid="B20">2013</xref>; Jarrin et al., <xref ref-type="bibr" rid="B17">2015</xref>).</p>
<p>Earlier reports suggested that the env proteins of LTNP were subject to purifying selection for their overall mutability and tendencies of variation in LTNP. The impact of stronger purifying selection on env genes of LTNP in contrast to RP might be an outcome of the survival value conferred by adaptive stability of the envelop protein. To further validate our observation, we then fitted linear regression lines between Ka and Ks through the origin assuming that Ka and Ks should both initially be zero at the moment of lineage divergence. The slopes for the lines of whole env genes for LTNP and RP were found to be 0.48 and 0.58, respectively. These results indicated that the increase in Ka with respect to Ks in RP was around 1.2 times faster than in LTNP. Such observations were suggestive of the fact that env proteins of LTNP genes have been subject to more severe evolutionary selection with respect to RP, which eventually has been reflected at a lower level of non-synonymous nucleotide substitution for a given level of synonymous substitution. Similarly, the slope for SP was found to be 0.49, suggesting that the increase in Ka with respect to Ks in SP was around 1.01 times faster than in LTNP. This, in turn, reinforced the hypothesis that unusual functional constraints have been instrumental on the env protein sequences of LTNP. Higher average Ka/Ks value of RP over SP and LTNP sequences (statistically significant) indicated that the evolutionary constraint on the RP sequences has been lesser compared to the impact of evolutionary forces on the SP and LTNP sequences. Such an instance of relaxed selection pressure might confer an added advantage to the RP sequences, associated with rapid infection in a short span of time, in accumulation of non-synonymous mutations and subsequent evasion of host (human) immune response (Khanlou et al., <xref ref-type="bibr" rid="B18">1996</xref>; Jarrin et al., <xref ref-type="bibr" rid="B17">2015</xref>).</p>
<p>HIV-1 shows variation in susceptibility of infection to human (Fellay et al., <xref ref-type="bibr" rid="B12">2007</xref>). Many attempts have been made to understand the viral genetics associated with disease progression in HIV-1 affected patients. However, molecular mechanism and host genetic influence on effective viral pathogenesis and subsequent progression of disease still remain unclear (Michael, <xref ref-type="bibr" rid="B27">1999</xref>; Carrington et al., <xref ref-type="bibr" rid="B8">2001</xref>).</p>
<p>Viruses mimic the codon usage profile of their respective hosts and rely largely on the chaperon apparatus of their hosts for proficient replication and enhanced robustness (Kunec and Osterrieder, <xref ref-type="bibr" rid="B22">2016</xref>). In this pretext, similarity index of a concerned virus with its respective host has been suggested to be an effective tool to properly assess the impact of host genomic influence on the patterns of associated viral codonic signatures. Several earlier studies have used similarity index to demonstrate the influence of host genome over the viral genome (Nasrullah et al., <xref ref-type="bibr" rid="B30">2015</xref>; Butt et al., <xref ref-type="bibr" rid="B6">2016</xref>). Estimation of similarity index with human host among the concerned types of env sequences associated with varying degrees of disease progression i.e., RP, SP, and LTNP signified the fact that the impact of selection by human host was most intense in LTNP compared to SP and RP. Such observations seemed logical in light of the fact that LTNP sequences reside in the human system for long and are characterized by high CD4&#x0002B; and CD8&#x0002B; T-cell counts (Zeller et al., <xref ref-type="bibr" rid="B51">1996</xref>; Kumar, <xref ref-type="bibr" rid="B20">2013</xref>). Thus, higher similarity index with human host in LTNP and usage of a higher proportion of identically selected codons with human host tend to be a consequence of prolonged residence and adaptive efficacy of LTNP in human body. In contrast, the instance of lower similarity index in RP with human host might be well explained from the viewpoint that RP stays in human body for a shorter span and causes rapid infection (Khanlou et al., <xref ref-type="bibr" rid="B18">1996</xref>). However, SP sequences displayed higher similarity index with human host in comparison to RP which might be attributed to a comparatively longer residence and slower disease progression than RP (Hogervorst et al., <xref ref-type="bibr" rid="B16">1995</xref>). Thus, observations pertaining to similarity index among the three types of env sequences with human host seemed well correlated with the rates of disease progression and duration of abode in human host.</p>
</sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusion</title>
<p>Decreased HIV-1 strain evolution correlates with weaker viral fitness and the inability to evade the host immune system. Some studies have shown that viral strains from LTNPs are less evolved and thus, less capable of evading the host immunological response when compared with progressor strains (Wang et al., <xref ref-type="bibr" rid="B44">2003</xref>; Sandon&#x000ED;s et al., <xref ref-type="bibr" rid="B37">2009</xref>). In our study, Ka/Ks decreased significantly for LTNP env gene sequences compared to RP gene sequences. Similar trend was observed for glycosylation sites of the respective sequences. Findings of the present study confirm that HIV-1 env gene of LTNP have been under stronger purifying selection compared to the respective RP counterpart. The env genes, coding for the env glycoproteins, experience severe selection constraints from the host due to their constant exposure to the host immune system. The env protein mediates HIV-1 entry into host CD4&#x0002B; T cells. Reduction in Ka/Ks for env genes of LTNP might result in significantly slower infection of CD4&#x0002B; T cells compared with RP. Our hypothesis seems well justified in light of the fact that the env protein plays essential role in receptor binding and cell entry (Wyatt and Sodroski, <xref ref-type="bibr" rid="B48">1998</xref>) and efficiently maintains extensive glycosylation on its surface to avoid antibody neutralization (Reitter et al., <xref ref-type="bibr" rid="B34">1998</xref>). In this perspective, it might be suggested that env gene evolution occurs mainly by negative selection with the occurrence of mutation that might not reach fixation in the viral population.</p>
<p>Present study addresses an evolutionary perspective that recognizes the ongoing dynamic interplay between human host and genetic diversity of HIV. Our analysis also provides an elucidation of the driving force accountable for the selection of favorable host genetic profiles as well as for viral mutations that are best adapted to the concerned host effects. Outcomes of the current research work confer a vivid portrait about the role of host factors in determining the overall progression of HIV infection.</p>
</sec>
<sec id="s6">
<title>Author contributions</title>
<p>SB planned and designed the work. AR and RB generated the data. SB, AR, and RB analyzed data and prepared the manuscript. All authors read and approved the final manuscript.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</sec>
</body>
<back>
<ack>
<p>Authors thankfully acknowledge the Department of Biotechnology, Government of India (BT/BI/12/055/2012), for their computational facility at Bioinformatics Centre, Tripura University.</p>
</ack>
<sec sec-type="supplementary-material" id="s7">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://journal.frontiersin.org/article/10.3389/fmicb.2017.01083/full#supplementary-material">http://journal.frontiersin.org/article/10.3389/fmicb.2017.01083/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table1.DOC" id="SM1" mimetype="application/msword" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet1.DOC" id="SM2" mimetype="application/msword" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arts</surname> <given-names>E. J.</given-names></name> <name><surname>Qui&#x000F1;ones-Mateu</surname> <given-names>M. E.</given-names></name></person-group> (<year>2003</year>). <article-title>Sorting out the complexities of HIV-1 fitness</article-title>. <source>AIDS</source> <volume>17</volume>, <fpage>780</fpage>&#x02013;<lpage>781</lpage>. <pub-id pub-id-type="doi">10.1097/01.aids.0000050881.72891.32</pub-id><pub-id pub-id-type="pmid">12646811</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bagnarelli</surname> <given-names>P.</given-names></name> <name><surname>Mazzola</surname> <given-names>F.</given-names></name> <name><surname>Menzo</surname> <given-names>S.</given-names></name> <name><surname>Montroni</surname> <given-names>M.</given-names></name> <name><surname>Butini</surname> <given-names>L.</given-names></name> <name><surname>Clementi</surname> <given-names>M.</given-names></name></person-group> (<year>1999</year>). <article-title>Host-specific modulation of the selective constraints driving human immunodeficiency virus type 1env gene evolution</article-title>. <source>J. Virol.</source> <volume>73</volume>, <fpage>3764</fpage>&#x02013;<lpage>3777</lpage>. <pub-id pub-id-type="pmid">10196271</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blattner</surname> <given-names>W.</given-names></name> <name><surname>Gallo</surname> <given-names>R. C.</given-names></name> <name><surname>Temin</surname> <given-names>H. M.</given-names></name></person-group> (<year>1988</year>). <article-title>Hiv causes AIDS</article-title>. <source>Science</source> <volume>241</volume>, <fpage>515</fpage>&#x02013;<lpage>516</lpage>. <pub-id pub-id-type="pmid">3399881</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bonhoeffer</surname> <given-names>S.</given-names></name> <name><surname>Holmes</surname> <given-names>E. C.</given-names></name> <name><surname>Nowak</surname> <given-names>M. A.</given-names></name></person-group> (<year>1995</year>). <article-title>Causes of HIV diversity</article-title>. <source>Nature</source> <volume>376</volume>, <fpage>125</fpage>&#x02013;<lpage>125</lpage>. <pub-id pub-id-type="pmid">7603560</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>A. J. L.</given-names></name></person-group> (<year>1997</year>). <article-title>Analysis of HIV-1 env gene sequences reveals evidence for a low effective number in the viral population</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>94</volume>, <fpage>1862</fpage>&#x02013;<lpage>1865</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.94.5.1862</pub-id><pub-id pub-id-type="pmid">9050870</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Butt</surname> <given-names>A. M.</given-names></name> <name><surname>Nasrullah</surname> <given-names>I.</given-names></name> <name><surname>Qamar</surname> <given-names>R.</given-names></name> <name><surname>Tong</surname> <given-names>Y.</given-names></name></person-group> (<year>2016</year>). <article-title>Evolution of codon usage in Zika virus genomes is host and vector specific</article-title>. <source>Emerg. Microbes Infect.</source> <volume>5</volume>:<fpage>e107</fpage>. <pub-id pub-id-type="doi">10.1038/emi.2016.106</pub-id><pub-id pub-id-type="pmid">27729643</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Canducci</surname> <given-names>F.</given-names></name> <name><surname>Marinozzi</surname> <given-names>M. C.</given-names></name> <name><surname>Sampaolo</surname> <given-names>M.</given-names></name> <name><surname>Berr&#x000E8;</surname> <given-names>S.</given-names></name> <name><surname>Bagnarelli</surname> <given-names>P.</given-names></name> <name><surname>Degano</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Dynamic features of the selective pressure on the human immunodeficiency virus type 1 (HIV-1) gp120 CD4-binding site in a group of long term non-progressor (LTNP) subjects</article-title>. <source>Retrovirology</source> <volume>6</volume>:<fpage>1</fpage>. <pub-id pub-id-type="doi">10.1186/1742-4690-6-4</pub-id><pub-id pub-id-type="pmid">19146663</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carrington</surname> <given-names>M.</given-names></name> <name><surname>Nelson</surname> <given-names>G.</given-names></name> <name><surname>O&#x00027;Brien</surname> <given-names>S. J.</given-names></name></person-group> (<year>2001</year>). <article-title>Considering genetic profiles in functional studies of immune responsiveness to HIV-1</article-title>. <source>Immunol. Lett.</source> <volume>79</volume>, <fpage>131</fpage>&#x02013;<lpage>140</lpage>. <pub-id pub-id-type="doi">10.1016/S0165-2478(01)00275-9</pub-id><pub-id pub-id-type="pmid">11595300</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chesebro</surname> <given-names>B.</given-names></name> <name><surname>Nishio</surname> <given-names>J.</given-names></name> <name><surname>Perryman</surname> <given-names>S.</given-names></name> <name><surname>Cann</surname> <given-names>A.</given-names></name> <name><surname>O&#x00027;Brien</surname> <given-names>W.</given-names></name> <name><surname>Chen</surname> <given-names>I. S.</given-names></name> <etal/></person-group>. (<year>1991</year>). <article-title>Identification of human immunodeficiency virus envelope gene sequences influencing viral entry into CD4-positive HeLa cells, T-leukemia cells, and macrophages</article-title>. <source>J. Virol.</source> <volume>65</volume>, <fpage>5782</fpage>&#x02013;<lpage>5789</lpage>. <pub-id pub-id-type="pmid">1920616</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>dos Reis</surname> <given-names>M.</given-names></name> <name><surname>Wernisch</surname> <given-names>L.</given-names></name> <name><surname>Savva</surname> <given-names>R.</given-names></name></person-group> (<year>2003</year>). <article-title>Unexpected correlations between gene expression and codon usage bias from microarray data for the whole Escherichia coli K-12 genome</article-title>. <source>Nucleic Acids Res.</source> <volume>31</volume>, <fpage>6976</fpage>&#x02013;<lpage>6985</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkg897</pub-id><pub-id pub-id-type="pmid">14627830</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Drummond</surname> <given-names>A.</given-names></name> <name><surname>Oliver</surname> <given-names>G.</given-names></name> <name><surname>Rambaut</surname> <given-names>A.</given-names></name></person-group> (<year>2003</year>). <article-title>Inference of viral evolutionary rates from molecular sequences</article-title>. <source>Adv. Parasitol.</source> <volume>54</volume>, <fpage>331</fpage>&#x02013;<lpage>358</lpage>. <pub-id pub-id-type="doi">10.1016/S0065-308X(03)54008-8</pub-id><pub-id pub-id-type="pmid">14711090</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fellay</surname> <given-names>J.</given-names></name> <name><surname>Shianna</surname> <given-names>K. V.</given-names></name> <name><surname>Ge</surname> <given-names>D.</given-names></name> <name><surname>Colombo</surname> <given-names>S.</given-names></name> <name><surname>Ledergerber</surname> <given-names>B.</given-names></name> <name><surname>Weale</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>A whole-genome association study of major determinants for host control of HIV-1</article-title>. <source>Science</source> <volume>317</volume>, <fpage>944</fpage>&#x02013;<lpage>947</lpage>. <pub-id pub-id-type="doi">10.1126/science.1143767</pub-id><pub-id pub-id-type="pmid">17641165</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ganeshan</surname> <given-names>S.</given-names></name> <name><surname>Dickover</surname> <given-names>R. E.</given-names></name> <name><surname>Korber</surname> <given-names>B. T.</given-names></name> <name><surname>Bryson</surname> <given-names>Y. J.</given-names></name> <name><surname>Wolinsky</surname> <given-names>S. M.</given-names></name></person-group> (<year>1997</year>). <article-title>Human immunodeficiency virus type 1 genetic evolution in children with different rates of development of disease</article-title>. <source>J. Virol.</source> <volume>71</volume>, <fpage>663</fpage>&#x02013;<lpage>677</lpage>. <pub-id pub-id-type="pmid">8985398</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goudsmit</surname> <given-names>J.</given-names></name> <name><surname>Debouck</surname> <given-names>C.</given-names></name> <name><surname>Meloen</surname> <given-names>R. H.</given-names></name> <name><surname>Smit</surname> <given-names>L.</given-names></name> <name><surname>Bakker</surname> <given-names>M.</given-names></name> <name><surname>Asher</surname> <given-names>D. M.</given-names></name> <etal/></person-group>. (<year>1988</year>). <article-title>Human immunodeficiency virus type 1 neutralization epitope with conserved architecture elicits early type-specific antibodies in experimentally infected chimpanzees</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>85</volume>, <fpage>4478</fpage>&#x02013;<lpage>4482</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.85.12.4478</pub-id><pub-id pub-id-type="pmid">2454471</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Greenacre</surname> <given-names>M. J.</given-names></name></person-group> (<year>1984</year>). <source>Theory and Applications of Correspondence Analysis</source>. <publisher-loc>London</publisher-loc>: <publisher-name>Academic Press</publisher-name>.</citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hogervorst</surname> <given-names>E.</given-names></name> <name><surname>Jurriaans</surname> <given-names>S.</given-names></name> <name><surname>de Wolf</surname> <given-names>F.</given-names></name> <name><surname>van Wijk</surname> <given-names>A.</given-names></name> <name><surname>Wiersma</surname> <given-names>A.</given-names></name> <name><surname>Valk</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>1995</year>). <article-title>Predictors for non-and slow progression in human immunodeficiency virus (HIV) type 1 infection: low viral RNA copy numbers in serum and maintenance of high HIV-1 p24-specific but not V3-specific antibody levels</article-title>. <source>J. Infect. Dis.</source> <volume>171</volume>, <fpage>811</fpage>&#x02013;<lpage>821</lpage>. <pub-id pub-id-type="doi">10.1093/infdis/171.4.811</pub-id><pub-id pub-id-type="pmid">7706807</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jarrin</surname> <given-names>I.</given-names></name> <name><surname>Pantazis</surname> <given-names>N.</given-names></name> <name><surname>Dalmau</surname> <given-names>J.</given-names></name> <name><surname>Phillips</surname> <given-names>A. N.</given-names></name> <name><surname>Olson</surname> <given-names>A.</given-names></name> <name><surname>Mussini</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Does rapid HIV disease progression prior to combination antiretroviral therapy hinder optimal CD4&#x0002B; T-cell recovery once HIV-1 suppression is achieved?</article-title> <source>AIDS</source> <volume>29</volume>, <fpage>2323</fpage>&#x02013;<lpage>2333</lpage>. <pub-id pub-id-type="doi">10.1097/QAD.0000000000000805</pub-id><pub-id pub-id-type="pmid">26544704</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Khanlou</surname> <given-names>H.</given-names></name> <name><surname>Salmon-Ceron</surname> <given-names>D.</given-names></name> <name><surname>Sicard</surname> <given-names>D.</given-names></name></person-group> (<year>1996</year>). <article-title>Characteristics of rapid progressors in HIV infection</article-title>. <source>Ann. Med. Interne</source> (<publisher-loc>Paris</publisher-loc>). <volume>148</volume>, <fpage>163</fpage>&#x02013;<lpage>166</lpage>. <pub-id pub-id-type="pmid">9238442</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Korber</surname> <given-names>B.</given-names></name> <name><surname>Muldoon</surname> <given-names>M.</given-names></name> <name><surname>Theiler</surname> <given-names>J.</given-names></name> <name><surname>Gao</surname> <given-names>F.</given-names></name> <name><surname>Gupta</surname> <given-names>R.</given-names></name> <name><surname>Lapedes</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2000</year>). <article-title>Timing the ancestor of the HIV-1 pandemic strains</article-title>. <source>Science</source> <volume>288</volume>, <fpage>1789</fpage>&#x02013;<lpage>1796</lpage>. <pub-id pub-id-type="doi">10.1126/science.288.5472.1789</pub-id><pub-id pub-id-type="pmid">10846155</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>P.</given-names></name></person-group> (<year>2013</year>). <article-title>Long term non-progressor (LTNP) HIV infection</article-title>. <source>Indian J. Med. Res.</source> <volume>138</volume>:<fpage>291</fpage>. <pub-id pub-id-type="pmid">24135172</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>S.</given-names></name> <name><surname>Stecher</surname> <given-names>G.</given-names></name> <name><surname>Tamura</surname> <given-names>K.</given-names></name></person-group> (<year>2016</year>). <article-title>MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets</article-title>. <source>Mol. Biol. Evol.</source> <volume>33</volume>, <fpage>1870</fpage>&#x02013;<lpage>1874</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msw054</pub-id><pub-id pub-id-type="pmid">27004904</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kunec</surname> <given-names>D.</given-names></name> <name><surname>Osterrieder</surname> <given-names>N.</given-names></name></person-group> (<year>2016</year>). <article-title>Codon pair bias is a direct consequence of dinucleotide bias</article-title>. <source>Cell Rep.</source> <volume>14</volume>, <fpage>55</fpage>&#x02013;<lpage>67</lpage>. <pub-id pub-id-type="doi">10.1016/j.celrep.2015.12.011</pub-id><pub-id pub-id-type="pmid">26725119</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lemey</surname> <given-names>P.</given-names></name> <name><surname>Pond</surname> <given-names>S. L. K.</given-names></name> <name><surname>Drummond</surname> <given-names>A. J.</given-names></name> <name><surname>Pybus</surname> <given-names>O. G.</given-names></name> <name><surname>Shapiro</surname> <given-names>B.</given-names></name> <name><surname>Barroso</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>Synonymous substitution rates predict HIV disease progression as a result of underlying replication dynamics</article-title>. <source>PLoS Comput. Biol.</source> <volume>3</volume>:<fpage>e29</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.0030029</pub-id><pub-id pub-id-type="pmid">17305421</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liang</surname> <given-names>H.</given-names></name> <name><surname>Zhou</surname> <given-names>W.</given-names></name> <name><surname>Landweber</surname> <given-names>L. F.</given-names></name></person-group> (<year>2006</year>). <article-title>SWAKK: a web server for detecting positive selection in proteins using a sliding window substitution rate analysis</article-title>. <source>Nucleic Acids Res.</source> <volume>34</volume>, <fpage>W382</fpage>&#x02013;<lpage>W384</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkl272</pub-id><pub-id pub-id-type="pmid">16845032</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Markham</surname> <given-names>R. B.</given-names></name> <name><surname>Wang</surname> <given-names>W.-C.</given-names></name> <name><surname>Weisstein</surname> <given-names>A. E.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Munoz</surname> <given-names>A.</given-names></name> <name><surname>Templeton</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>1998</year>). <article-title>Patterns of HIV-1 evolution in individuals with differing rates of CD4 T cell decline</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>95</volume>, <fpage>12568</fpage>&#x02013;<lpage>12573</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.95.21.12568</pub-id><pub-id pub-id-type="pmid">9770526</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McCutchan</surname> <given-names>F. E.</given-names></name></person-group> (<year>2006</year>). <article-title>Global epidemiology of HIV</article-title>. <source>J. Med. Virol.</source> <volume>78</volume>, <fpage>S7</fpage>&#x02013;<lpage>S12</lpage>. <pub-id pub-id-type="doi">10.1002/jmv.20599</pub-id><pub-id pub-id-type="pmid">16622870</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Michael</surname> <given-names>N. L.</given-names></name></person-group> (<year>1999</year>). <article-title>Host genetic influences on HIV-1 pathogenesis</article-title>. <source>Curr. Opin. Immunol.</source> <volume>11</volume>, <fpage>466</fpage>&#x02013;<lpage>474</lpage>. <pub-id pub-id-type="pmid">10448150</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moratorio</surname> <given-names>G.</given-names></name> <name><surname>Iriarte</surname> <given-names>A.</given-names></name> <name><surname>Moreno</surname> <given-names>P.</given-names></name> <name><surname>Musto</surname> <given-names>H.</given-names></name> <name><surname>Cristina</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>A detailed comparative analysis on the overall codon usage patterns in West Nile virus</article-title>. <source>Infect. Genet. Evol.</source> <volume>14</volume>, <fpage>396</fpage>&#x02013;<lpage>400</lpage>. <pub-id pub-id-type="doi">10.1016/j.meegid.2013.01.001</pub-id><pub-id pub-id-type="pmid">23333335</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nakamura</surname> <given-names>Y.</given-names></name> <name><surname>Gojobori</surname> <given-names>T.</given-names></name> <name><surname>Ikemura</surname> <given-names>T.</given-names></name></person-group> (<year>2000</year>). <article-title>Codon usage tabulated from international DNA sequence databases: status for the year 2000</article-title>. <source>Nucleic Acids Res.</source> <volume>28</volume>:<fpage>292</fpage>. <pub-id pub-id-type="doi">10.1093/nar/28.1.292</pub-id><pub-id pub-id-type="pmid">10592250</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nasrullah</surname> <given-names>I.</given-names></name> <name><surname>Butt</surname> <given-names>A. M.</given-names></name> <name><surname>Tahir</surname> <given-names>S.</given-names></name> <name><surname>Idrees</surname> <given-names>M.</given-names></name> <name><surname>Tong</surname> <given-names>Y.</given-names></name></person-group> (<year>2015</year>). <article-title>Genomic analysis of codon usage shows influence of mutation pressure, natural selection, and host features on Marburg virus evolution</article-title>. <source>BMC Evol. Biol.</source> <volume>15</volume>:<fpage>174</fpage>. <pub-id pub-id-type="doi">10.1186/s12862-015-0456-4</pub-id><pub-id pub-id-type="pmid">26306510</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nei</surname> <given-names>M.</given-names></name> <name><surname>Gojobori</surname> <given-names>T.</given-names></name></person-group> (<year>1986</year>). <article-title>Simple methods for estimating the numbers of synonymous and non-synonymous nucleotide substitutions</article-title>. <source>Mol. Biol. Evol.</source> <volume>3</volume>, <fpage>418</fpage>&#x02013;<lpage>426</lpage>. <pub-id pub-id-type="pmid">3444411</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Peden</surname> <given-names>J. F.</given-names></name></person-group> (<year>2000</year>). <source>Analysis of Codon Usage</source>. <publisher-loc>Nottingham</publisher-loc>: <publisher-name>University of Nottingham</publisher-name>.</citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rangel</surname> <given-names>H. R.</given-names></name> <name><surname>Weber</surname> <given-names>J.</given-names></name> <name><surname>Chakraborty</surname> <given-names>B.</given-names></name> <name><surname>Gutierrez</surname> <given-names>A.</given-names></name> <name><surname>Marotta</surname> <given-names>M. L.</given-names></name> <name><surname>Mirza</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2003</year>). <article-title>Role of the human immunodeficiency virus type 1 envelope gene in viral fitness</article-title>. <source>J. Virol.</source> <volume>77</volume>, <fpage>9069</fpage>&#x02013;<lpage>9073</lpage>. <pub-id pub-id-type="doi">10.1128/JVI.77.16.9069-9073.2003</pub-id><pub-id pub-id-type="pmid">12885922</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reitter</surname> <given-names>J. N.</given-names></name> <name><surname>Means</surname> <given-names>R. E.</given-names></name> <name><surname>Desrosiers</surname> <given-names>R. C.</given-names></name></person-group> (<year>1998</year>). <article-title>A role for carbohydrates in immune evasion in AIDS</article-title>. <source>Nat. Med.</source> <volume>4</volume>, <fpage>679</fpage>&#x02013;<lpage>684</lpage>. <pub-id pub-id-type="doi">10.1038/nm0698-679</pub-id><pub-id pub-id-type="pmid">9623976</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ross</surname> <given-names>H. A.</given-names></name> <name><surname>Rodrigo</surname> <given-names>A. G.</given-names></name></person-group> (<year>2002</year>). <article-title>Immune-mediated positive selection drives human immunodeficiency virus type 1 molecular variation and predicts disease duration</article-title>. <source>J. Virol.</source> <volume>76</volume>, <fpage>11715</fpage>&#x02013;<lpage>11720</lpage>. <pub-id pub-id-type="doi">10.1128/JVI.76.22.11715-11720.2002</pub-id><pub-id pub-id-type="pmid">12388731</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roy</surname> <given-names>A.</given-names></name> <name><surname>Mukhopadhyay</surname> <given-names>S.</given-names></name> <name><surname>Sarkar</surname> <given-names>I.</given-names></name> <name><surname>Sen</surname> <given-names>A.</given-names></name></person-group> (<year>2015</year>). <article-title>Comparative investigation of the various determinants that influence the codon and amino acid usage patterns in the genus Bifidobacterium</article-title>. <source>World J. Microbiol. Biotechnol.</source> <volume>31</volume>, <fpage>959</fpage>&#x02013;<lpage>981</lpage>. <pub-id pub-id-type="doi">10.1007/s11274-015-1850-1</pub-id><pub-id pub-id-type="pmid">25842224</pub-id></citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sandon&#x000ED;s</surname> <given-names>V.</given-names></name> <name><surname>Casado</surname> <given-names>C.</given-names></name> <name><surname>Alvaro</surname> <given-names>T.</given-names></name> <name><surname>Pernas</surname> <given-names>M.</given-names></name> <name><surname>Olivares</surname> <given-names>I.</given-names></name> <name><surname>Garc&#x000ED;a</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>A combination of defective DNA and protective host factors are found in a set of HIV-1 ancestral LTNPs</article-title>. <source>Virology</source> <volume>391</volume>, <fpage>73</fpage>&#x02013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.1016/j.virol.2009.05.022</pub-id><pub-id pub-id-type="pmid">19559455</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seo</surname> <given-names>T.-K.</given-names></name> <name><surname>Thorne</surname> <given-names>J. L.</given-names></name> <name><surname>Hasegawa</surname> <given-names>M.</given-names></name> <name><surname>Kishino</surname> <given-names>H.</given-names></name></person-group> (<year>2002</year>). <article-title>Estimation of effective population size of HIV-1 within a host: a pseudomaximum-likelihood approach</article-title>. <source>Genetics</source> <volume>160</volume>, <fpage>1283</fpage>&#x02013;<lpage>1293</lpage>. <pub-id pub-id-type="pmid">11973287</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shackelton</surname> <given-names>L. A.</given-names></name> <name><surname>Parrish</surname> <given-names>C. R.</given-names></name> <name><surname>Holmes</surname> <given-names>E. C.</given-names></name></person-group> (<year>2006</year>). <article-title>Evolutionary basis of codon usage and nucleotide composition bias in vertebrate DNA viruses</article-title>. <source>J. Mol. Evol.</source> <volume>62</volume>, <fpage>551</fpage>&#x02013;<lpage>563</lpage>. <pub-id pub-id-type="doi">10.1007/s00239-005-0221-1</pub-id><pub-id pub-id-type="pmid">16557338</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharp</surname> <given-names>P. M.</given-names></name> <name><surname>Li</surname> <given-names>W.-H.</given-names></name></person-group> (<year>1986</year>). <article-title>An evolutionary perspective on synonymous codon usage in unicellular organisms</article-title>. <source>J. Mol. Evol.</source> <volume>24</volume>, <fpage>28</fpage>&#x02013;<lpage>38</lpage>. <pub-id pub-id-type="doi">10.1007/BF02099948</pub-id><pub-id pub-id-type="pmid">3104616</pub-id></citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharp</surname> <given-names>P. M.</given-names></name> <name><surname>Tuohy</surname> <given-names>T. M.</given-names></name> <name><surname>Mosurski</surname> <given-names>K. R.</given-names></name></person-group> (<year>1986</year>). <article-title>Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes</article-title>. <source>Nucleic Acids Res.</source> <volume>14</volume>, <fpage>5125</fpage>&#x02013;<lpage>5143</lpage>. <pub-id pub-id-type="doi">10.1093/nar/14.13.5125</pub-id><pub-id pub-id-type="pmid">3526280</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Simmonds</surname> <given-names>P.</given-names></name> <name><surname>Zhang</surname> <given-names>L. Q.</given-names></name> <name><surname>McOmish</surname> <given-names>F.</given-names></name> <name><surname>Balfe</surname> <given-names>P.</given-names></name> <name><surname>Ludlam</surname> <given-names>C. A.</given-names></name> <name><surname>Brown</surname> <given-names>A. J.</given-names></name></person-group> (<year>1991</year>). <article-title>Discontinuous sequence change of human immunodeficiency virus (HIV) type 1 env sequences in plasma viral and lymphocyte-associated proviral populations <italic>in vivo</italic>: implications for models of HIV pathogenesis</article-title>. <source>J. Virol.</source> <volume>65</volume>, <fpage>6266</fpage>&#x02013;<lpage>6276</lpage>. <pub-id pub-id-type="pmid">1920632</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Strunnikova</surname> <given-names>N.</given-names></name> <name><surname>Ray</surname> <given-names>S. C.</given-names></name> <name><surname>Livingston</surname> <given-names>R. A.</given-names></name> <name><surname>Rubalcaba</surname> <given-names>E.</given-names></name> <name><surname>Viscidi</surname> <given-names>R. P.</given-names></name></person-group> (<year>1995</year>). <article-title>Convergent evolution within the V3 loop domain of human immunodeficiency virus type 1 in association with disease progression</article-title>. <source>J. Virol.</source> <volume>69</volume>, <fpage>7548</fpage>&#x02013;<lpage>7558</lpage>. <pub-id pub-id-type="pmid">7494261</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>B.</given-names></name> <name><surname>Mikhail</surname> <given-names>M.</given-names></name> <name><surname>Dyer</surname> <given-names>W. B.</given-names></name> <name><surname>Zaunders</surname> <given-names>J. J.</given-names></name> <name><surname>Kelleher</surname> <given-names>A. D.</given-names></name> <name><surname>Saksena</surname> <given-names>N. K.</given-names></name></person-group> (<year>2003</year>). <article-title>First demonstration of a lack of viral sequence evolution in a non-progressor, defining replication-incompetent HIV-1 infection</article-title>. <source>Virology</source> <volume>312</volume>, <fpage>135</fpage>&#x02013;<lpage>150</lpage>. <pub-id pub-id-type="doi">10.1016/S0042-6822(03)00159-4</pub-id><pub-id pub-id-type="pmid">12890627</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weiss</surname> <given-names>R.</given-names></name></person-group> (<year>1993</year>). <article-title>How does HIV cause AIDS?</article-title> <source>Science</source> <volume>260</volume>, <fpage>1273</fpage>&#x02013;<lpage>1279</lpage>. <pub-id pub-id-type="pmid">8493571</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wong</surname> <given-names>E. H. M.</given-names></name> <name><surname>Smith</surname> <given-names>D. K.</given-names></name> <name><surname>Rabadan</surname> <given-names>R.</given-names></name> <name><surname>Peiris</surname> <given-names>M.</given-names></name> <name><surname>Poon</surname> <given-names>L. L. M.</given-names></name></person-group> (<year>2010</year>). <article-title>Codon usage bias and the evolution of influenza A viruses. Codon Usage Biases of Influenza Virus</article-title>. <source>BMC Evol. Biol.</source> <volume>10</volume>:<fpage>253</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2148-10-253</pub-id><pub-id pub-id-type="pmid">20723216</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wright</surname> <given-names>F.</given-names></name></person-group> (<year>1990</year>). <article-title>The effective number of codons used in a gene</article-title>. <source>Gene</source> <volume>87</volume>, <fpage>23</fpage>&#x02013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1016/0378-1119(90)90491-9</pub-id><pub-id pub-id-type="pmid">2110097</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wyatt</surname> <given-names>R.</given-names></name> <name><surname>Sodroski</surname> <given-names>J.</given-names></name></person-group> (<year>1998</year>). <article-title>The HIV-1 envelope glycoproteins: fusogens, antigens, and immunogens</article-title>. <source>Science</source> <volume>280</volume>, <fpage>1884</fpage>&#x02013;<lpage>1888</lpage>. <pub-id pub-id-type="pmid">9632381</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>Z.</given-names></name></person-group> (<year>2007</year>). <article-title>PAML 4: phylogenetic analysis by maximum likelihood</article-title>. <source>Mol. Biol. Evol.</source> <volume>24</volume>, <fpage>1586</fpage>&#x02013;<lpage>1591</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msm088</pub-id><pub-id pub-id-type="pmid">17483113</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yoshida</surname> <given-names>I.</given-names></name> <name><surname>Sugiura</surname> <given-names>W.</given-names></name> <name><surname>Shibata</surname> <given-names>J.</given-names></name> <name><surname>Ren</surname> <given-names>F.</given-names></name> <name><surname>Yang</surname> <given-names>Z.</given-names></name> <name><surname>Tanaka</surname> <given-names>H.</given-names></name></person-group> (<year>2011</year>). <article-title>Change of positive selection pressure on HIV-1 envelope gene inferred by early and recent samples</article-title>. <source>PLoS ONE</source> <volume>6</volume>:<fpage>e18630</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0018630</pub-id><pub-id pub-id-type="pmid">21526184</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeller</surname> <given-names>J. M.</given-names></name> <name><surname>McCain</surname> <given-names>N. L.</given-names></name> <name><surname>Swanson</surname> <given-names>B.</given-names></name></person-group> (<year>1996</year>). <article-title>Immunological and virological markers of HIV-disease progression</article-title>. <source>J. Assoc. Nurses AIDS Care</source> <volume>7</volume>, <fpage>15</fpage>&#x02013;<lpage>27</lpage>. <pub-id pub-id-type="doi">10.1016/S1055-3290(96)80034-3</pub-id><pub-id pub-id-type="pmid">8825177</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>J. H.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Sun</surname> <given-names>D. J.</given-names></name> <name><surname>Ma</surname> <given-names>Q.</given-names></name> <name><surname>Chen</surname> <given-names>H. T.</given-names></name> <name><surname>Ma</surname> <given-names>L. N.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>The distribution of synonymous codon choice in the translation initiation region of dengue virus</article-title>. <source>PLoS ONE</source> <volume>8</volume>:<fpage>e77239</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0077239</pub-id><pub-id pub-id-type="pmid">24204777</pub-id></citation></ref>
</ref-list>
</back>
</article>
