ORIGINAL RESEARCH article

Front. Microbiol., 27 May 2022

Sec. Virology

Volume 13 - 2022 | https://doi.org/10.3389/fmicb.2022.855232

HIV Transmembrane Glycoprotein Conserved Domains and Genetic Markers Across HIV-1 and HIV-2 Variants

  • HIV-1 Molecular Epidemiology Laboratory, Instituto Ramón y Cajal de Investigación Sanitaria (IRYCIS), Microbiology Department, Hospital Universitario Ramón y Cajal, CIBER en Epidemiología y Salud Pública (CIBERESP), Red en Investigación Translacional en Infecciones Pediátricas (RITIP), Madrid, Spain

Article metrics

View details

7

Citations

7,1k

Views

1,5k

Downloads

Abstract

HIV envelope transmembrane glycoproteins gp41 (HIV-1) and gp36 (HIV-2) present high variability and play a key role in the HIV-host cell membrane's fusion, as a target for human broadly neutralizing antibodies (bnAbs) and drugs. Thus, a better knowledge of amino acid (aa) conservation across structural domains and HIV variants can help to identify conserved targets to direct new therapeutic and diagnostic strategies. All available gp41/gp36 nucleotide sequences were downloaded from Los Alamos National Laboratory (LANL) HIV Sequence Database, selecting 17,078 sequences ascribed to HIV-1 and HIV-2 variants with ≥3 sequences. After aligning and translating into aa with MEGAv6.0, an in-house bioinformatics program (EpiMolBio) was used to identify the most conserved aa and the aa changes that were specific for each variant (V-markers) vs. HXB2/BEN (HIV-1/HIV-2) reference sequence. We analyzed the presence of specific aa changes among V-markers affecting infectivity, gp41 structure, function, or resistance to the enfuvirtide viral fusion inhibitor (T-20). We also inferred the consensus sequences per HIV variant, describing in each HIV-1 group (M, N, O, P) the conservation level along the complete gp41 per structural domain and locating in each binding site the anti-gp41 human Abs (bnAbs and non bnAbs) described in LANL. We found 38.3/59.7% highly conserved aa present in ≥90% of the 16,803/275 gp41/gp36 sequences ascribed to 105/3 HIV-1/HIV-2 variants, with 9/12.6% of them showing complete conservation across LANL sequences. The fusion peptide, its proximal region, the N-heptad repeat, and the membrane-proximal external region were the gp41 domains with ≥84% of conserved aa in the HIV-1 consensus sequence, the target of most Abs. No natural major resistance mutations to T-20 were observed. Our results show, for the first time, a complete conservation study of gp41/gp36 per variant in the largest panel of HIV variants analyzed to date, providing useful information for a more rational design of drugs, vaccines, and molecular detection tests targeting the HIV transmembrane glycoprotein.

Introduction

The human immunodeficiency virus (HIV) envelope transmembrane glycoproteins gp41 and gp36 are located on the HIV-1 and HIV-2 virion's membrane forming trimers with gp120 and gp105 glycoproteins, respectively. These proteins mediate the viral fusion with the host cell's membrane, allowing the entry of genetic material and viral proteins into the cells (Blumenthal et al., 2012). Therefore, they are important targets for the development of fusion inhibitors, such as antiretrovirals (ARV) (Qadir and Malik, 2010), antibodies (Abs) (Caillat et al., 2020), and aptamers (Li et al., 2016) used as HIV treatment. HIV-1 gp41, with 345 amino acids (aa), can be segmented into three domains (Figure 1A): one exposed ectodomain (aa 1-172), a transmembrane region (TM, aa 173-194), and an intraviral not exposed C-terminal domain (CT, aa 195-345). The ectodomain is exposed and can be divided further into distinct functional regions important for fusion and virus infectivity: an N-terminal hydrophobic region termed as fusion peptide (FP, aa 1-16), necessary to bind the virus to the cell membrane, followed by an N-terminal alpha-helical region or N-heptad repeat region (NHR, aa 33-70). These domains were linked by a fusion peptide proximal region (FPPR, aa 17-32) rich in polar aa and critical for HIV-1 fusion and infectivity because it stabilizes the envelope trimers (Lu et al., 2019). A loop immune-dominant linker with a disulfide bridge (IL, aa 71-113) links the NHR to a C-heptad repeat region (CHR, aa 114-153) (Figure 1B). A membrane-proximal external region (MPER, aa 154-172), conformationally flexible, connects the CHR to the TM region (Louis et al., 2016). The gp41 NHR domain is the target of ARV, such as enfuvirtide (T-20), the only clinically approved viral fusion inhibitor for the treatment of HIV infection (Lazzarin, 2005; Oldfield et al., 2005), and MPER for immunogens, as it contains epitopes to broadly neutralizing antibodies (bnAbs), such as 2F5, 4E10, Z13, and 10E8 (Los Alamos HIV Molecular Immunology Database, 2021a).

Figure 1

Figure 1

HIV-1 gp41 structural domains (A) and 3D model structure (B). Color code: light-purple (fusion peptide, FP), light-pink (fusion peptide proximal region, FPPR), blue (N-terminal alpha-helical region, NHR), green (immune-dominant linker, IL), orange (C-terminal heptad repeat region, CHR), dark-red (membrane-proximal external region, MPER), red (transmembrane region, TM), light-yellow (C-terminal domain, CT), and black (disulfide-bridged loop in IL, S-S). (A) Over-domains numbering according to HXB2 isolate aa and below-domains numbering according to HXB2 gp160 nucleotides (Genbank Accession number: K03455). (B) Gp41 3D model structure (PDB ID: 6OLP) (Berman et al., 2000; Rantalainen and Cottrell, 2019; Torrents de la Peña et al., 2019).

HIV is one of the most genetically diverse pathogens due to its high mutation and recombination rates, large population size, and rapid replication rate (Hemelaar, 2012). The HIV epidemic is the result of two types of viruses: HIV-1 and HIV-2, which are closely related to SIVcpz (Gao et al., 1999) and SIVsm (Gao et al., 1992), respectively. HIV-1 causes most of the HIV infections worldwide and has been divided according to genetic homology into four groups: M (major or main), N (non-M, non-O) (Simon et al., 1998), O (outlier) (De Leys et al., 1990), and P (Plantier et al., 2009). However, the global HIV epidemic is related to group M (Hemelaar et al., 2019), which has been subdivided into 10 subtypes (A–D, F–H, J–L) and eight sub-subtypes (A1, A2, A3, A4, A5, A6, F1, F2) (Robertson et al., 2000; Salminen, 2000; Leitner et al., 2005; Yamaguchi et al., 2020), at least 118 circulating recombinant forms (CRF) (Los Alamos HIV Sequence Database, 2021a) and uncountable unique recombinant forms (URF). HIV-2 has been classified into nine groups (A-I) and two recombinants (CRF01_AB and URF) (Visseaux et al., 2016).

Since the HIV transmembrane glycoprotein is a key target for human bnAbs and anti-HIV drugs, a better knowledge of aa conservation across structural domains and HIV variants can help to identify conserved targets to direct new therapeutic and diagnostic strategies. Furthermore, each HIV variant presents natural polymorphisms and unique aa changes (V-markers) along the viral genome fixed during viral evolution (Arenas et al., 2016), which have not been described to date in each HIV variant, mainly in HIV-1 group M non-B subtypes and recombinants, which are the majority in the pandemic (Hemelaar et al., 2019), and in HIV-2.

We present, for the first time, the most conserved gp41 domains in each HIV-1 variant per structural domain and anti-gp41 antibody binding domains in the largest panel of HIV-1 variants analyzed to date, identifying the V-markers and the consensus transmembrane glycoprotein sequence for each HIV variant (type, group, subtype, sub-subtype, and CRF).

Materials and Methods

HIV Transmembrane Glycoprotein Sequences

In October and November of 2020, we downloaded all available gp41 (HIV-1, 345 aa) and gp36 (HIV-2, 350 aa) nucleotides sequences from Los Alamos National Laboratory (LANL) HIV Sequence Database (Los Alamos HIV Sequence Database, 2021b) selecting one sequence per patient and grouping them per HIV variant (types, groups, subtypes, sub-subtypes, and CRF). URF sequences were not included in this study. They were aligned, edited, and translated into aa with the MEGAv6.0 program (Molecular Evolutionary Genetics Analysis: https://www.megasoftware.net/) (Tamura et al., 2013). MUSCLE function (Multiple Sequence Comparison by Log Expectation) (Edgar, 2004) was used for alignments with HXB2 (HIV-1 subtype B, GenBank accession number: K03455) or BEN (HIV-2 subtype A, GenBank accession number: M30502) reference sequences, removing nucleotides insertions. Sequences with stop codons in unusual positions and groups, subtypes, sub-subtypes, and CRF with <3 sequences were excluded from the study, except group P, as it was necessary to establish the HIV-1 aa sequence consensus.

Gp41/Gp36 aa Conservation and Inferred Consensus Sequences

Using a bioinformatics tool developed in our laboratory (EpiMolBio program), we analyzed the gp41/gp36 aa conservation of HIV variants with at least three available sequences (except group P, with two sequences). We also inferred the aa consensus sequences for gp41 HIV-1/gp36 HIV-2 and each HIV-1/HIV-2 variant, providing the most conserved aa in each residue.

The new EpiMolBio bioinformatics tool reported the percentage of sequences with a conserved aa in each position of any protein, establishing a color code for this study depending on the frequency of each conserved aa in gp41/gp36: white (<90%), light-blue/light-pink (≥90– <100%), and dark blue-green/fuchsia (100% or complete conservation across considered sequences), respectively. We also studied the level of conserved aa per gp41 structural domain in each HIV-1 non-M group, group M, and HIV-1 consensus sequences. For the analysis, we summed the aa conservation percentages (percentages of most conserved aa) of gp41 residues involved in each secondary structural domain and then divided the sum by the total number of residues per domain.

Finally, we used WebLogo (https://weblogo.berkeley.edu/logo.cgi) (Crooks et al., 2004) to generate a figure showing the HIV-1 and HIV-1 group M gp41, as well as the HIV-2 gp36 consensus sequences, including in each protein position the most frequent aa present in the corresponding alignment. The HIV-1 group M gp41 consensus sequence was generated after the alignment of group M variants consensus sequences, the HIV-1 gp41 consensus sequence was generated after the alignment of the HIV-1 groups (M, N, O, P) consensus sequences, and the HIV-2 gp36 consensus sequence was generated after the alignment of the HIV-2 groups A, B, and the CRF01_AB consensus sequences. The aa letters were represented as large as their conservation percentage with a color code according to their side-chain: in black, non-polar aliphatic (glycine, G; alanine, A; valine, V; leucine, L; methionine, M; isoleucine, I); in dark blue-green, aromatic (phenylalanine, F; tyrosine, Y; tryptophan, W); in fuchsia, polar uncharged (serine, S; threonine, T; cysteine, C; proline, P; asparagine, N; glutamine; Q); in light-blue, positively charged (lysine, K; arginine, R; histidine, H); and in light-pink, negatively charged (aspartic acid, D; glutamic acid, E). Deletions were represented by a yellow “X”.

HIV-1 Monoclonal Human Antibodies Location

We analyzed the overall conservation in each gp41 secondary-structure domain across the four HIV-1 groups and in the HIV-1 consensus sequence. We also identified the aa conservation level in each HIV-1 group and each anti-gp41 human Abs (bnAbs and non-bnAbs) binding domain described in the LANL HIV Immunology Database (Los Alamos HIV Molecular Immunology Database, 2021a), showing lineal epitopes in blue and non-linear epitopes recognized by bnAbs in orange.

Gp41/Gp36 Natural Polymorphisms and V-Markers Across HIV Variants

We described the gp41/gp36 natural polymorphisms and V-markers in HIV-1 and HIV-2 variants using the EpiMolBio program. To identify the aa changes present in ≥90% sequences in each HIV-1 or HIV-2 variant (natural polymorphisms), we compared all gp41 sequences with the HXB2 isolate, HIV-1 consensus, and HIV-1 group M consensus sequences, and all gp36 sequences with BEN isolate, and the HIV-2 consensus sequence. Among the natural polymorphisms found, we identified the exclusive V-markers of each HIV-1 or HIV-2 variant, not present in any other HIV variant. The color code for that analysis was light-blue (≥90– <100%) and dark blue-green (100%) for HIV-1 and light-pink (≥90– <100%) and fuchsia (100%) for HIV-2.

We also studied the presence of major T-20 resistance mutations according to the 2019 edition of the International Antiviral Society–USA (2019 IAS-USA) drug resistance mutations list (Wensing et al., 2019) among the V-markers found. Moreover, we looked for L44M change due to association with 1.8-fold resistance to T-20 in vitro (Mink et al., 2005).

Furthermore, we examined the presence of natural polymorphisms and specific V-markers on four gp41 positions (N160, W161, F162, and W169), described as key for HIV-1 neutralization by 10E8 (Huang et al., 2012; Kwon et al., 2016), which is a highly potent bnAb-recognizing gp41 MPER (epitope NWFDISNWLWYIK, gp41 positions 160-172) (Los Alamos HIV Molecular Immunology Database, 2021b). Other key reasons to study 10E8 bnAbs were that it was not detected by some serological diagnostic tests targeting gp41 (Smith et al., 2021) and was recently used to design new strategies for the development of a more efficient HIV-1 vaccine (Kuchar et al., 2021).

Finally, some gp41 changes (S23P, S23A, T25A, T27A, and I48P) affecting infectivity, gp41 structure or function (Alsahafi et al., 2015; Lu et al., 2019), were also studied across HIV-1 variants.

Results

Analyzed Gp41/Gp36 Sequences and Inferred Consensus Sequences

We downloaded all 18,348 HIV transmembrane glycoprotein sequences from the LANL database. Once sequences with stop codons in unusual positions were discarded and after excluding variants with <3 sequences (except HIV-1 group P), a total of 17,078 gp41/gp36 sequences from 108 variants, including types, groups, subtypes, sub-subtypes, and CRF, were finally used in this study: 16,803 gp41 (HIV-1, 105 variants) and 275 gp36 (HIV-2, three variants) (Table 1). Among the HIV-1 gp41 sequences, 99 belonged to non-M groups (N, O, and P) and 16,704 were ascribed to group M (nine subtypes, six sub-subtypes, and 87 CRF). The gp41 sequences from group M sub-subtype A5, subtype F, CRF30_0206, CRF84_A1D, CRF91_01C, CRF94_cpx, CRF97_01B, CRF101_01B, and CRF102_0107 were not available in LANL. Gp36 sequences from groups E, H, and I were also absent. The variants with the highest representation in HIV-1 group M were subtype B (48.5%), subtype C (23.9%), and recombinant CRF01_AE (13.1%). In HIV-2, the most represented group was A (85.5%).

Table 1

VariantsN°SEQS
N11
HIV-1Non-M GroupsO86
HIV-1Group MSubtypesA20
A1752
A28
A33
A6167
B8106
C3985
D183
F1122
F215
G153
H11
J8
K3
L3
HIV-1Group MCRFCRF01_AE2186
CRF02_AG291
CRF03_AB5
CRF04_cpx8
CRF05_DF4
CRF06_cpx14
CRF07_BC126
CRF08_BC62
CRF09_cpx5
CRF10_CD3
CRF11_cpx26
CRF12_BF18
CRF13_cpx10
CRF14_BG12
CRF15_01B9
CRF16_A2D4
CRF17_BF6
CRF18_cpx5
CRF19_cpx5
CRF20_BG4
CRF21_A2D4
CRF22_01A113
CRF24_BG4
CRF25_cpx5
CRF26_A5U5
CRF27_cpx4
CRF28_BF5
CRF29_BF7
HIV-1Group MCRFCRF31_BC3
CRF32_06A63
CRF33_01B7
CRF34_01B3
CRF35_AD21
CRF36_cpx3
CRF37_cpx4
CRF39_BF3
CRF40_BF4
CRF41_CD3
CRF42_BF13
CRF43_02G5
CRF44_BF3
CRF45_cpx5
CRF46_BF8
CRF47_BF3
CRF48_01B3
CRF49_cpx5
CRF50_A1D4
CRF51_01B7
CRF52_01B3
CRF53_01B4
CRF54_01B3
CRF55_01B9
CRF56_cpx4
CRF57_BC7
CRF58_01B6
CRF59_01B8
CRF60_BC5
CRF61_BC3
CRF62_BC3
CRF63_02A15
CRF64_BC8
CRF65_cpx6
CRF66_BF3
CRF68_01B3
CRF69_01B7
CRF70_BF3
CRF71_BF15
CRF72_BF5
CRF74_01B3
CRF75_BF3
CRF77_cpx4
CRF78_cpx3
CRF79_01073
HIV-1Group MCRFCRF82_cpx6
CRF84_A1D0
CRF85_BC11
CRF86_BC3
CRF87_cpx3
CRF88_BC3
CRF89_BF3
CRF90_BF16
CRF92_C2U5
CRF93_cpx3
CRF95_02B5
CRF96_cpx3
1
CRF100_01C3
CRF102_01070
CRF103_01B4
HIV-2GroupsA235
B34
HIV-2CRFCRF01_AB6

Gp41/gp36 LANL sequences were analyzed in this study.

In red, variants with <3 sequences. Only HIV-1 group P, with 2 sequences, was included in the study (with an asterisk). N°, number; SEQS, sequences; CRF, circulating recombinant forms.

Consensus sequences at aa level were inferred by EpiMolBio for HIV-1 group M (Figure 2A), HIV-1 (Figure 2B), and HIV-2 (Figure 2C) to study the homology at aa level across variants. The gp41 HIV-1 consensus sequence was generated after aligning the four HIV-1 groups (M, N, O, P) consensus sequences and the HIV-1 group M consensus after aligning 102 group M variants with at least three sequences. The gp36 HIV-2 consensus sequence was inferred after aligning the 275 gp36 LANL sequences ascribed to groups A, B, and CRF01_AB. Supplementary Tables 1, 2 show the inferred HIV-1 and HIV-2 transmembrane consensus sequences in this study, respectively.

Figure 2

Figure 2

Inferred consensus transmembrane sequences for HIV-1 group M (A), HIV-1 (B), and HIV-2 (C). Each residue includes the most frequent aa present in the corresponding alignment. The aa code letters were represented in proportion to their percentage of conservation. Color code according to their side-chain: non-polar aliphatic (glycine, G; alanine, A; valine, V; leucine, L; methionine, M; isoleucine, I) in black; aromatic (phenylalanine, F; tyrosine, Y; tryptophan, W) in dark blue-green; polar uncharged (serine, S; threonine, T; cysteine, C; proline, P; asparagine, N; glutamine, Q) in fuchsia; positively charged (lysine, K; arginine, R; histidine, H) in light-blue, and negatively charged (aspartic acid, D; glutamic acid, E) in light-pink. Deletions were represented by “X” in yellow.

Amino Acid Conservation of Gp41 and Gp36 Across HIV Variants

We identified the gp41 and gp36 residues with ≥90% and 100% conservation across sequences from each analyzed variant (Table 2). The gp41 HIV-1 consensus aa sequence had 38.3% of the 345 gp41 residues conserved in ≥90% of gp41 sequences, and 9% of aa were 100% conserved. Higher conservation of gp36 HIV-2 aa consensus sequence was observed compared to HIV-1 gp41 aa consensus sequence, with 59.7% of 350 gp36 residues conserved in ≥90% of gp36 sequences, and 12.6% of aa totally conserved. Thus, 4 out of 10 aa positions were ≥90% conserved in the HIV-1 gp41 consensus sequence, rising to 6 out of 10 in the HIV-2 gp36 consensus sequence.

Table 2

Conservation
≥90%100%
VariantsSEQSN°AA%N°AA%
HIV-1HIV-1 Consensus413238.3319
N1129685.825273
Non-M GroupsO8621562.311633.6
P230688.730688.7
HIV-1Group M Consensus10220860.3267.5
Group MSubtypesA2027278.818854.5
A175224169.96017.4
A2823668.423668.4
A3329685.829685.8
A616728783.211232.5
B810624771.6226.4
C398524069.6113.2
D18325774.55816.8
F112226376.29527.5
F21524169.919957.7
G15324771.69026.1
H1127479.420459.1
J824169.924169.9
K328983.828983.8
L32768027680
HIV-1Group MCRFCRF01_AE218626075.44312.5
CRF02_AG29126075.46619.1
CRF03_AB531190.131190.1
CRF04_cpx824972.224972.2
CRF05_DF427078.327078.3
CRF06_cpx1425573.920459.1
CRF07_BC12628682.914241.2
CRF08_BC6228181.417350.1
CRF09_cpx529084.129084.1
CRF10_CD329184.329184.3
CRF11_cpx2624671.317049.3
CRF12_BF182457118553.6
CRF13_cpx1028582.623668.4
CRF14_BG1230688.726877.7
CRF15_01B924370.424370.4
CRF16_A2D427379.127379.1
CRF17_BF624671.324671.3
CRF18_cpx527078.327078.3
CRF19_cpx527178.627178.6
CRF20_BG429184.329184.3
CRF21_A2D427278.827278.8
CRF22_01A11326677.120258.6
CRF24_BG429986.729986.7
CRF25_cpx527078.327078.3
CRF26_A5U525774.525774.5
CRF27_cpx425473.625473.6
CRF28_BF526877.726877.7
CRF29_BF723668.423668.4
CRF31_BC329184.329184.3
CRF32_06A633219332193
CRF33_01B726677.126677.1
CRF34_01B332794.832794.8
CRF35_AD2128983.821963.5
CRF36_cpx330488.130488.1
CRF37_cpx427780.327780.3
CRF39_BF32838228382
CRF40_BF42768027680
CRF41_CD334198.834198.8
CRF42_BF1332293.330187.2
CRF43_02G527780.327780.3
CRF44_BF328682.928682.9
CRF45_cpx526576.826576.8
CRF46_BF822164.122164.1
CRF47_BF329585.529585.5
CRF48_01B329986.729986.7
CRF49_cpx526877.726877.7
CRF50_A1D429886.429886.4
CRF51_01B728682.928682.9
CRF52_01B330287.530287.5
CRF53_01B428883.528883.5
CRF54_01B331089.931089.9
CRF55_01B928281.728281.7
CRF56_cpx43389833898
CRF57_BC724771.624771.6
CRF58_01B629485.229485.2
CRF59_01B829585.529585.5
CRF60_BC528883.528883.5
CRF61_BC333095.733095.7
CRF62_BC332293.332293.3
CRF63_02A1530287.525975.1
CRF64_BC82527325273
CRF65_cpx629485.229485.2
CRF66_BF328281.728281.7
CRF68_01B332092.832092.8
CRF69_01B728081.228081.2
CRF70_BF328682.928682.9
CRF71_BF1525373.321261.4
CRF72_BF525774.525774.5
CRF74_01B33008730087
CRF75_BF331691.631691.6
CRF77_cpx430488.130488.1
CRF78_cpx327980.927980.9
CRF79_0107331892.231892.2
CRF82_cpx629986.729986.7
CRF83_cpx1131992.529886.4
HIV-1Group MCRFCRF85_BC1130889.325172.8
CRF86_BC330588.430588.4
CRF87_cpx331290.431290.4
CRF88_BC330387.830387.8
CRF89_BF329084.129084.1
CRF90_BF1625874.825874.8
CRF92_C2U523467.823467.8
CRF93_cpx327780.327780.3
CRF95_02B529485.229485.2
CRF96_cpx330588.430588.4
CRF100_01C329384.929384.9
CRF103_01B430287.530287.5
HIV-2HIV-2 Consensus320959.74412.6
GroupsA235231665515.7
B3425171.715945.4
CRFCRF01_AB629082.929082.9

Number and percentage of highly (≥90%, 100%) conserved aa per HIV-1/HIV-2 variant.

Data showed the number of aa conserved in ≥90 or 100% of sequences per analyzed variant. Gp41 (HIV-1), 345 aa; gp36 (HIV-2), 350 aa. Gp41 HIV-1 consensus sequence was generated after the alignment of the four HIV-1 groups (M, N, O, P) consensus sequences, the HIV-1 group M consensus after the alignment of 102 group M variants with ≥3 sequences, and the HIV-2 consensus after the alignment of three HIV-2 variants. N°, number; SEQS, sequences; AA, amino acids; %, aa percentage; CRF, circulating recombinant forms.

For HIV-1 groups, the highest percentage of conserved aa in ≥90%/100% of their HIV-1 gp41 LANL sequences were found in groups P (88.7%/88.7%) and N (85.8%/73%), followed by O (62.3%/33.6%) and M (60.3%/7.5%). Therefore, group P was the HIV-1 non-M group with the highest number of highly conserved gp41 residues, while group O had the fewest.

Among the 16,704 group M gp41 sequences, CRF46_BF had the lowest number of aa present in ≥90% of sequences (64.1%, eight sequences) and CRF41_CD the highest (98.8%, three sequences). When considering the completely conserved (100%) residues, subtype C showed the lowest number (3.2%, 3,985 sequences) and CRF41_CD the highest (98.8%, three sequences). Thus, among group M variants, CRF41_CD was the most conserved variant, while subtype C was the least conserved.

Regarding HIV-2, the variant with the highest percentage of ≥90%/100% conserved aa in gp36 protein was CRF01_AB (82.9%/82.9%), followed by group B (71.7%/45.4%) and group A (66%/15.7%).

Gp41 Conservation in Each HIV-1 Group per Structural Domain and Anti-Gp41 Antibody Binding Sites

When we studied the level of conserved aa in each gp41 structural domain, we observed that gp41 conservation differed between HIV-1 groups and structural domains (Table 3), ranging from 66.4% to 98.9% conservation. The gp41 domain in the HIV-1 consensus sequence with the highest conservation was NHR (86.2%), followed by FPPR (84.9%), MPER (84.2%), and FP (84%). We observed a high conservation percentage in FPPR (96%) and NHR (95.4%) domains in the group M consensus sequence, despite the lower overall gp41 conservation vs. the other three HIV-1 groups. Group N gp41 presented the most conserved FP (98.9%) and FPPR (98.8%) domains, group O the best conserved MPER domain (95.7%), and group P the most conserved NHR (98.7%), TM (95.5%), IL (95.3%), and CT (93.7%) gp41 domains (Table 3). CHR domain presented the highest conservation (95.0%) in groups N and P. Considering all analyzed HIV-1 variants, the FP, FPPR, NHR, and MPER were the gp41 domains with the highest (≥84%) number of conserved aa, being the target for most anti-gp41 Abs. The least conserved domains in the HIV-1 consensus sequence were TM (69%) and CT (66.4%) located inside the virus.

Table 3

% AA Conservation HIV-1 GP41
Gp41 structural domainsFPFPPRNHRILCHRMPERTMCTGP41
Gp41 residues1–1617–3233–7071–113114–153154–172173–194195–3451–345
Group N98.998.896.595.095.090.994.693.594.6
Group O97.787.695.379.283.895.789.086.087.3
Group P93.890.698.795.395.089.595.593.794.3
Cons. Group M86.796.095.486.679.686.191.981.085.1
Cons. HIV-184.084.986.273.370.784.269.066.472.8

Percentage of aa conservation in each gp41 structural domain across HIV-1 groups and HIV-1 consensus sequences.

For the analysis, we used the aa conservation percentages per residue (percentages of most conserved aa) of each HIV-1 group and HIV-1 consensus sequence provided in Supplementary Table 1 with color code. After identifying the residues of each secondary structural domain, the sum of the aa conservation percentages of each of them was divided by the total number of residues per domain. In color, the highest conservation level in each structural domain. In bold, the most conserved structural domain in each HIV-1 group and HIV-1 consensus sequence. Gp41 residues according to HXB2 HIV-1 isolate. N°, number; AA, amino acids; Cons, aa consensus sequence; FP, fusion peptide; FPPR, fusion peptide proximal region; NHR, N-terminal alpha-helical region; IL, immune-dominant linker; CHR, C-terminal heptad repeat region; MPER, membrane-proximal external region; TM, transmembrane region; CT, C-terminal domain.

The conservation level in each domain for human anti-gp41 bnAbs and non-bnAbs across HIV-1 groups was also described (Figure 3), located between the FP and MPER domains since TM and CT (the less conserved gp41 domains according to our data) are not exposed. The bnAbs are directed to CHR (2F5) and mainly to the MPER domain (2F5, Z13, 4E10, 10E8, and derivatives). No anti-gp41 Ab-binding sites were shown in LANL in the first 13 aa of the FP domain, in the 17 first positions of NHR, in the last 9 residues of IL, and the first 14 aa of CHR. The same was true in the target peptide to the fusion inhibitor T-20 (GIVQQQNNLL, NHR, residues 36-45), even though it presented high conservation across HIV-1 groups (Figure 3).

Figure 3

Figure 3

Amino acid conservation level in gp41 residues involved in anti-gp41 human antibody binding domains described in LANL HIV Immunology Database across HIV-1 groups and HIV-1 consensus sequences. Data showed the percentage of the most conserved aa with the following conservation color code: white (aa conserved <90%), light-blue (aa conserved ≥90%– <100%), and dark blue-green (aa conserved 100%). Anti-gp41 human antibodies include bnAbs and non-bnAbs described in Los Alamos HIV Immune Database (Los Alamos HIV Molecular Immunology Database, 2021a). Linear epitopes are shown in blue and non-linear epitopes, recognized by broadly neutralizing antibodies (bnAbs), are shown in orange. We also indicated the T-20 fusion inhibitor binding domain in red (T-20BD). N°, number; AA, amino acids; SEQS, sequences; Cons, aa consensus sequence; FP, fusion peptide; FPPR, fusion peptide proximal region; NHR, N-terminal alpha-helical region; IL, immune-dominant linker; CHR, C-terminal heptad repeat region; MPER, membrane-proximal external region; TM, transmembrane region; CT, C-terminal domain; G, glycine; A, alanine; V, valine; L, leucine; M, methionine; I, isoleucine; F, phenylalanine; Y, tyrosine; W, tryptophan; S, serine; T, threonine; C, cysteine; P, proline; N, asparagine; Q, glutamine; K, lysine; R, arginine; H, histidine; D, aspartic acid; E, glutamic acid; poly, polyclonal.

Natural Polymorphisms and V-Markers

All-natural polymorphisms (aa present in ≥90% sequences) found in each HIV variant are shown in Supplementary Table 3 (HIV-1) and Supplementary Table 4 (HIV-2). The number of polymorphisms in gp41 across HIV-1 non-B variants (different from subtype B) increased when using the HXB2 subtype B sequence as reference. HIV-2 group A presented fewer polymorphisms in gp36 when HIV-2 BEN subtype A group was used as reference. Figure 4 shows the natural polymorphisms that could be considered as exclusive V-markers for HIV-1 gp41 non-M groups (86 V-markers, Figure 4A), for HIV-1 group M gp41 variants (120 V-markers, Figure 4B), and HIV-2 gp36 variants (24 V-markers, Figure 4C). The specific V-markers per variant can be found in Supplementary Tables 3, 4.

Figure 4

Figure 4

Location of exclusive transmembrane glycoprotein V-markers in non-M groups (A), HIV-1 group M variants (B), and HIV-2 variants (C). V-markers in gp41 HIV-1 non-M groups are described with respect to the HIV-1 consensus sequence (yellow color). V-markers in gp41 HIV-1 group M variants are described with respect to the HIV-1 group M consensus sequence (light-blue color). V-markers in gp36 HIV-2 variants are described with respect to the HIV-2 consensus sequence (light-pink color). These V-markers and their corresponding variant can be also found in Supplementary Table 3 (for HIV-1) and Supplementary Table 4 (for HIV-2). V-markers, aa changes that were specific for each variant present in ≥90% of gp41 or gp36 sequences of an exclusive HIV-1 or HIV-2 variant; N°, number; AA, amino acids; CONS, aa consensus sequence; G, glycine; A, alanine; V, valine; L, leucine; M, methionine; I, isoleucine; F, phenylalanine; Y, tyrosine; W, tryptophan; S, serine; T, threonine; C, cysteine; P, proline; N, asparagine; Q, glutamine; K, lysine; R, arginine; H, histidine; D, aspartic acid; E, glutamic acid; X, aa deletion.

No V-markers associated with T-20 major resistance were found in HIV-1 groups N, O, P, group M, and HIV-2 variants. Only M44 was present in the group P consensus gp41 sequence (Supplementary Tables 1, 3). Group O gp41 consensus sequence carried D42 V-marker, but no T42, a residue associated with high T-20 resistance.

After the analysis of the four gp41 key positions (N160, W161, F162, and W169) for HIV-1 neutralization by the anti-MPER bnAb 10E8, we observed high (90–100%) conservation of W161 and W169 in the transmembrane consensus sequences across HIV-1 variants. L162 appeared in group O and L/M162 in group P gp41 consensus sequence. Residue 160 showed the highest variability, carrying the group P and many HIV-1 group M variants another aa, mainly serine.

Finally, regarding changes in gp41 residues (A23, P23, A25, A27, and P48) affecting infectivity, gp41 structure, or function, we observed that A23 appeared in groups P and O and in CRF63_02A gp41 consensus sequences, while A25 appeared in group O, subtype L, CRF41_CD, and CRF47_BF group M variants. V48 appeared in the CRF90_BF1 gp41 consensus sequence but P48 did not.

Discussion

This is the most up-to-date descriptive study related to HIV-1/HIV-2 transmembrane envelope proteins, providing the conservation level and the V-markers in each HIV variant, identifying the conserved gp41 domains in each HIV-1 group per structural domain and per anti-gp41 antibody binding site in the most extensive panel of HIV-1 gp41/HIV-2 gp36 sequences (n = 17,078) and variants (105 HIV-1 variants, three HIV-2 variants) analyzed to date. Our in-house bioinformatics tool was also used to establish consensus sequences for gp41/gp36 proteins to study the aa level conservation across HIV variants.

The results have shown that the degree of conservation of the protein can differ across HIV variants and transmembrane structural domains, as our group previously described with fewer sequences and variants (Holguín et al., 2007). The higher level of conserved residues across variants in gp36 (275 sequences) vs. gp41 (16,803 sequences) could be explained by the lower number of gp36 sequences in LANL due to the lower HIV-2 prevalence and worldwide infections (1–2 of 38 million HIV infections) explained by its lower transmissibility and virulence vs. HIV-1 (Azevedo-Pereira and Santos-Costa, 2016; Clinical Info and gov, 2019; Kapoor and Padival, 2021). A higher level of fully conserved aa observed in HIV-1 non-M groups (99 sequences) vs. group M (16,704 sequences) could also be explained by the lower prevalence of groups N, O, and P in the pandemic (Mourez et al., 2013). The same happens in most CRF, with few available gp41 sequences in LANL, due to their low prevalence and the absence of gp41 sequencing in countries where they circulate, with sequencing or research not always available.

All gp41 secondary structure ectodomains presented >70% of conservation (HIV-1 consensus sequence), which supports their key role in the viral cycle and the importance of structure maintenance for virus-cell membranes fusion and viral entry (Pancera et al., 2014). Our data revealed that the best-conserved gp41 domains were N-HR (86.2%), FPPR (84.9%), MPER (84.2%), and FP (84%). It is essential to highlight that precisely NHR, FP, and FPPR gp41 domains have recently been implicated in the interaction with the fusion inhibitor T-20, which prevents the virus from entering the cell (Xu et al., 2019).

Regarding the gp41-CT region (aa 195-345), located inside the virus, it was the least conserved domain in the HIV-1 and group M gp41 consensus sequences, despite its important role in gp41 structure and function (Fernandez and Freed, 2018). CT domain is involved in envelope conformation (Castillo-Menendez et al., 2018), being essential for efficient envelope incorporation into budding HIV-1 particles (Murakami and Freed, 2000), and requiring interaction with gag matrix protein (MA) (Wyma et al., 2000; Eastep et al., 2021). However, the domains in each protein involved in this interaction are still unknown (Fernandez and Freed, 2018). A link between the matrix trimers' formation and the binding between MA and gp41 CT has also been reported (Alfadhli et al., 2016).

The critical role of the FPPR or polar region at the N terminus of gp41 for HIV-1 fusion and infectivity by stabilizing envelope trimers (Lu et al., 2019) could explain the high gp41 aa conservation percentage found in this domain. S23 within FPPR is structurally essential for maintaining HIV-1 envelope trimer, viral fusogenicity, and infectivity (Lu et al., 2019). Single or combined mutations S23P, T25A, and T27A in the FPPR region abolished or significantly decreased HIV-1 infectivity without affecting viral production, and S23A change significantly reduced HIV-1 infectivity and fusogenicity but not envelope expression and cleavage (Lu et al., 2019). In our study, only S23A and T25A substitutions appeared in a low number of variants with low prevalence in the pandemic. The absence of I48P gp41 change in our sequence set could be explained by its high impact on the quaternary conformation and function of the envelope glycoprotein trimer (Alsahafi et al., 2015).

Analyzing the genetic variability of HIV glycoprotein transmembrane within its immunodominant epitopes is important for understanding its possible impact on HIV Abs detection (Dorn et al., 2000; Dong et al., 2005; Smith et al., 2021). HIV-1 bnAbs can neutralize most HIV-1 strains from diverse genetic and geographic backgrounds (Binley et al., 2004; Wang and Zhang, 2020). The anti-gp41 bnAbs can recognize the MPER (Huang et al., 2012) and FP domains (Yuan et al., 2019), as well as the gp120/gp41 interphase (Huang et al., 2014; Scharf et al., 2014; Wang and Zhang, 2020). The conformational plasticity of FP could facilitate the recognition of the virus by bnAbs (Yuan et al., 2019). It is also known that HIV-1 variability can impact on bnAbs reactivity in HIV diagnostic tests targeting gp41, leading to non-reactive results with different serological diagnostic assays (Smith et al., 2021). Furthermore, the identification of HIV antibody binding domains is critical for vaccine development studies (Kuchar et al., 2021). Many bnAbs directed to gp41 have been described (Los Alamos HIV Molecular Immunology Database, 2021a), and the identification of the aa conservation level across HIV variants on the recognized epitopes by each bnAb and in those key gp41 residues for viral neutralization is of particular interest. The high variability found in key gp41 residue 160 across some HIV-1 variants could explain the previously reported failure of 10E8 recognition by some different serological diagnostic tests (Smith et al., 2021). We could not analyze the gp41 aa conservation in the gp41 target sequence per each failing diagnostic test because manufacturers do not provide detailed information regarding which part of the gp41 sequence was targeted in their HIV diagnostic assays detecting the transmembrane protein, which can also differ across assays. The provided information in the Supplementary Tables of the manuscript can help manufacturers and other researchers design new gp41-based molecular and serological diagnostic tests to identify those HIV-1 variants whose diagnosis could be compromised by viral genetic variability.

Natural infection by HIV-2 also leads to the elicitation of high titers of bnAbs against primary HIV-2 strains (De Silva et al., 2012; Kong et al., 2012; Özkaya Sahin et al., 2012), although not all bnAbs to HIV-2 neutralize HIV-1 variants (Björling et al., 1993). In fact, MPER-specific Abs induced by vaccination with recombinant gp36 proteins in rats did not neutralize HIV-2 (Behrendt et al., 2012). It is known that HIV types present different mechanisms for the processing of envelope glycoproteins from a smaller env precursor in HIV-2 (gp140) than in HIV-1 (gp160) (Rey et al., 1989). To date, unfortunately, the exact aa residues of each secondary structure in HIV-2 gp36 have not been specified. However, some conserved gp36 epitopes have been reported (Jadhav et al., 2011), as well as immunogenic sites, antibody binding sites in the TM and IL region of HIV-2 transmembrane gp36 (Chiodi et al., 1993), and other epitopes recognized by bnAbs in HIV-2 envelope gp140 (Kong et al., 2012).

Previous studies showed the importance of consensus sequence establishment to guide vaccine development (Ellenberger et al., 2002; Sliepen et al., 2019). For the first time, our study also provides the aa consensus sequence of the transmembrane glycoprotein in each HIV variant (type, group, subtype, sub-subtype, and CRF). Moreover, we showed the conservation level across their sequences, which could be helpful to look for highly conserved peptides to direct new ARV, Abs, aptamers, probes, or primers to control or diagnose HIV infection regardless of the HIV variant. Furthermore, we showed the first identification of specific natural polymorphisms of gp41 and gp36 that can be considered as V-markers for all HIV-1 and HIV-2 variants, which should be considered in the new strategies for developing HIV-1 vaccines based on epitopes recognized by bnAbs (Kuchar et al., 2021). The exclusive HIV V-markers identified in gp41/gp36 sequences could help in faster and preliminary HIV variant identification if required, before doing the phylogenetic study, the gold standard method for correct HIV variant characterization. New studies are required to evaluate the structural and biological impact of the different levels of aa conservation in gp41 across HIV-1 variants and the specific V-markers found in the viral transmembrane protein.

Long-term exposure to the first entry inhibitor T-20 induces drug-resistant mutations (Pérez-Alvarez et al., 2006). Interestingly, none of the variants had V-markers associated with major resistance to T-20, as we previously reported testing 79 different HIV variants from naïve patients (Holguín et al., 2007). Thus, no natural major resistance mutations to T-20 were observed. The L44M change found in group P consensus sequence, previously associated with 1.8-fold resistance to T-20 in vitro (Mink et al., 2005), was previously found in T-20 naïve subjects from China (Chang et al., 2021), maybe reflecting a resistance transmission during primoinfection. New HIV-1 fusion inhibitors are under development (Luque and Camarasa, 2021) and previous studies showed that optimized T-20 derivates could have been effective inhibitors of infection for multiple HIV-1 variants (Chen et al., 2019).

The main limitation of the study was the absence of LANL sequences from some HIV-1 (sub-subtype A5, subtype F, CRF30_0206, CRF84_A1D, CRF91_01C, CRF94_cpx, CRF97_01B, CRF101_01B, CRF102_0107) and HIV-2 variants (groups E, H, I), as well as the scarce number of HIV gp41/gp36 sequences in other 14 HIV variants with <3 sequences in LANL, which meant that they could not be included in the analysis (except group P).

The information provided in this manuscript aims to help other researchers studying the biological, therapeutical, diagnostic, or structural role of gp41 to identify the natural polymorphisms and specific V-markers per variant in each gp41/gp36 residue or epitope according to their interest. This study will also be useful for a more rational design of anti-gp41 drugs and vaccines and future HIV molecular diagnostic tests directed to transmembrane HIV protein.

Funding

This research and AV-A's contract were funded by Instituto de Salud Carlos III (PI18/00904 Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016) and co-financed by the European Regional Development Fund “A way to achieve Europe” (ERDF). RR was funded by FONDOS FUR 2020/0285. The funder had no role in the design of the study, in the collection, analyses, or interpretation of data, in the writing of the manuscript, or in the decision to publish the results.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Statements

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Files, further inquiries can be directed to the corresponding authors.

Author contributions

AV-A downloaded and analyzed the HIV LANL sequences under study, validated some EpiMolBio functions necessary for sequences analyses, performed the computations, discussed results, and wrote the first draft of the manuscript. RR developed the in-house EpiMolBio bioinformatics program, validated some EpiMolBio functions necessary for sequences analyses, discussed results, and reviewed the final version of the manuscript. ÁH designed and supervised the study, reviewed and edited the manuscript, funding application, and project administration. All authors approved the final version submitted.

Acknowledgments

We thank Paul Devlin for proofreading the manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2022.855232/full#supplementary-material

References

  • 1

    AlfadhliA.MackA.RitchieC.CylinderI.HarperL.TedburyP. R.et al. (2016). Trimer enhancement mutation effects on HIV-1 matrix protein binding activities. J. Virol. 90, 56575664. 10.1128/JVI.00509-16

  • 2

    AlsahafiN.DebbecheO.SodroskiJ.FinziA. (2015). Effects of the I559P gp41 change on the conformation and function of the human immunodeficiency virus (HIV-1) membrane envelope glycoprotein trimer. PLoS ONE10:e0122111. 10.1371/journal.pone.0122111

  • 3

    ArenasM.Lorenzo-RedondoR.Lopez-GalindezC. (2016). Influence of mutation and recombination on HIV-1 in vitro fitness recovery. Mol. Phylogenet. Evol. 94, 264270. 10.1016/j.ympev.2015.09.001

  • 4

    Azevedo-PereiraJ. M.Santos-CostaQ. (2016). HIV interaction with human host: HIV-2 as a model of a less virulent infection. AIDS Rev. 18, 44–53. Available online at: https://www.aidsreviews.com/resumen.php?id=1327&indice=2016181&u=unp

  • 5

    BehrendtR.FiebigU.KurthR.DennerJ. (2012). Induction of antibodies binding to the membrane proximal external region of gp36 of HIV-2. Intervirology55, 252256. 10.1159/000324483

  • 6

    BermanH. M.WestbrookJ.FengZ.GillilandG.BhatT. N.WeissigH.et al. (2000). The protein data bank. Nucleic Acids Res. 28, 235242. 10.1093/nar/28.1.235

  • 7

    BinleyJ. M.WrinT.KorberB.ZwickM. B.WangM.ChappeyC.et al. (2004). Comprehensive cross-clade neutralization analysis of a panel of anti-human immunodeficiency virus type 1 monoclonal antibodies. J. Virol. 78, 1323213252. 10.1128/JVI.78.23.13232-13252.2004

  • 8

    BjörlingE.ScarlattiG.von GegerfeltA.AlbertJ.BiberfeldG.ChiodiF.et al. (1993). Autologous neutralizing antibodies prevail in HIV-2 but not in HIV-1 infection. Virology193, 528530. 10.1006/viro.1993.1160

  • 9

    BlumenthalR.DurellS.ViardM. (2012). HIV entry and envelope glycoprotein-mediated fusion. J. Biol. Chem. 287, 4084140849. 10.1074/jbc.R112.406272

  • 10

    CaillatC.GuilligayD.SulbaranG.WeissenhornW. (2020). Neutralizing antibodies targeting HIV-1 gp41. Viruses12:E1210. 10.3390/v12111210

  • 11

    Castillo-MenendezL. R.WittK.EspyN.PrinciottoA.MadaniN.PachecoB.et al. (2018). Comparison of uncleaved and mature human immunodeficiency virus membrane envelope glycoprotein trimers. J. Virol. 92, e00277e00218. 10.1128/JVI.00277-18

  • 12

    ChangL.ZhaoJ.GuoF.JiH.ZhangL.JiangX.et al. (2021). HIV-1 gp41 genetic diversity and enfuvirtide resistance-associated mutations among enfuvirtide-naïve patients in southern China. Virus Res. 292:198215. 10.1016/j.virusres.2020.198215

  • 13

    ChenG.CookJ. D.YeW.LeeJ. E.SidhuS. S. (2019). Optimization of peptidic HIV-1 fusion inhibitor T20 by phage display. Protein. Sci. 28, 15011512. 10.1002/pro.3669

  • 14

    ChiodiF.BjörlingE.SamuelssonA.NorrbyE. (1993). Antigenic and immunogenic sites of HIV-2 glycoproteins. Chem. Immunol. 56, 6177. 10.1159/000319156

  • 15

    Clinical Info H. I. V.gov. (2019). HIV-2 Infection. Available online at: https://clinicalinfo.hiv.gov/es/node/9288 (accessed September 15, 2021).

  • 16

    CrooksG. E.HonG.ChandoniaJ. M.BrennerS. E. (2004). WebLogo: a sequence logo generator. Genome Res. 14, 11881190. 10.1101/gr.849004

  • 17

    De LeysR.VanderborghtB.Vanden HaeseveldeM.HeyndrickxL.van GeelA.WautersC.et al. (1990). Isolation and partial characterization of an unusual human immunodeficiency retrovirus from two persons of West-Central African origin. J. Virol. 64, 12071216. 10.1128/jvi.64.3.1207-1216.1990

  • 18

    De SilvaT. I.Aasa-ChapmanM.CottenM.HuéS.RobinsonJ.Bibollet-RucheF.et al. (2012). Potent autologous and heterologous neutralizing antibody responses occur in HIV-2 infection across a broad range of infection outcomes. J. Virol. 86, 930946. 10.1128/JVI.06126-11

  • 19

    DongX.-N.YingJ.WuY.ChenY.-H. (2005). Genetic variability of principal neutralizing determinants on HIV-1 gp41 and its correlation with subtypes. Immunol. Lett.101, 104107. 10.1016/j.imlet.2005.04.013

  • 20

    DornJ.MasciotraS.YangC.DowningR.BiryahwahoB.MastroT. D.et al. (2000). Analysis of genetic variability within the immunodominant epitopes of envelope gp41 from human immunodeficiency virus type 1 (HIV-1) group M and its impact on HIV-1 antibody detection. J. Clin. Microbiol. 38, 773780. 10.1128/JCM.38.2.773-780.2000

  • 21

    EastepG. N.GhanamR. H.GreenT. J.SaadJ. S. (2021). Structural characterization of HIV-1 matrix mutants implicated in envelope incorporation. J. Biol. Chem. 296:100321. 10.1016/j.jbc.2021.100321

  • 22

    EdgarR. C.. (2004). MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 17921797. 10.1093/nar/gkh340

  • 23

    EllenbergerD. L.LiB.LupoL. D.OwenS. M.NkengasongJ.Kadio-MorokroM. S.et al. (2002). Generation of a consensus sequence from prevalent and incident HIV-1 infections in West Africa to guide AIDS vaccine development. Virology302, 155163. 10.1006/viro.2002.1577

  • 24

    FernandezM. V.FreedE. O. (2018). Meeting Review: 2018 International workshop on structure and function of the lentiviral gp41 cytoplasmic tail. Viruses10:613. 10.3390/v10110613

  • 25

    GaoF.BailesE.RobertsonD. L.ChenY.RodenburgC. M.MichaelS. F.et al. (1999). Origin of HIV-1 in the chimpanzee Pan troglodytes troglodytes. Nature397, 436441. 10.1038/17130

  • 26

    GaoF.YueL.WhiteA. T.PappasP. G.BarchueJ.HansonA. P.et al. (1992). Human infection by genetically diverse SIV SM-related HIV-2 in West Africa. Nature358, 495499. 10.1038/358495a0

  • 27

    HemelaarJ.. (2012). The origin and diversity of the HIV-1 pandemic. Trends Mol. Med. 18, 182192. 10.1016/j.molmed.2011.12.001

  • 28

    HemelaarJ.ElangovanR.YunJ.Dickson-TettehL.FlemingerI.KirtleyS.et al. (2019). Global and regional molecular epidemiology of HIV-1, 1990–2015: a systematic review, global survey, and trend analysis. Lancet Infect. Dis. 19, 143155. 10.1016/S1473-3099(18)30647-9

  • 29

    HolguínA.De ArellanoE. R.SorianoV. (2007). Amino acid conservation in the gp41 transmembrane protein and natural polymorphisms associated with enfuvirtide resistance across HIV-1 variants. AIDS Res. Hum. Retroviruses23, 10671074. 10.1089/aid.2006.0256

  • 30

    HuangJ.KangB. H.PanceraM.LeeJ. H.TongT.FengY.et al. (2014). Broad and potent HIV-1 neutralization by a human antibody that binds the gp41-120 interface. Nature515, 138142. 10.1038/nature13601

  • 31

    HuangJ.OfekG.LaubL.LouderM. K.Doria-RoseN. A.LongoN. S.et al. (2012). Broad and potent neutralization of HIV-1 by a gp41-specific human antibody. Nature491, 406412. 10.1038/nature11544

  • 32

    JadhavS.TripathyS.KulkarniS.ChaturbhujD.GhareR.BhattacharyaJ.et al. (2011). Genetic conservation in gp36 transmembrane sequences of Indian HIV type 2 isolates. AIDS Res. Hum. Retroviruses27, 13371343. 10.1089/aid.2011.0063

  • 33

    KapoorA. K.PadivalS. (2021). HIV-2. In StatPearls. StatPearls Publishing. Available online at: http://www.ncbi.nlm.nih.gov/books/NBK572083/~[Internet].

  • 34

    KongR.LiH.Bibollet-RucheF.DeckerJ. M.ZhengN. N.GottliebG. S.et al. (2012). Broad and potent neutralizing antibody responses elicited in natural HIV-2 infection. J. Virol. 86, 947960. 10.1128/JVI.06155-11

  • 35

    KucharM.KosztyuP.Daniel LiškováV.CernýJ.PetrokováH.VróblováE.et al. (2021). Myomedin scaffold variants targeted to 10E8 HIV-1 broadly neutralizing antibody mimic gp41 epitope and elicit HIV-1 virus-neutralizing sera in mice. Virulence12, 12711287. 10.1080/21505594.2021.1920251

  • 36

    KwonY. D.GeorgievI. S.OfekG.ZhangB.AsokanM.BailerR. T.et al. (2016). Optimization of the solubility of HIV-1-neutralizing antibody 10E8 through somatic variation and structure-based design. J. Virol. 90, 58995914. 10.1128/JVI.03246-15

  • 37

    LazzarinA.. (2005). Enfuvirtide: the first HIV fusion inhibitor. Exp. Opin. Pharmacother.6, 453464. 10.1517/14656566.6.3.453

  • 38

    LeitnerT.KorberB.DanielsM.CalefC.FoleyB. (2005). HIV-1 subtype and circulating recombinant form (CRF) reference sequences, 2005. HIV Seq. Compend.2005, 4148. Available online at: https://www.hiv.lanl.gov/content/sequence/HIV/COMPENDIUM/2005/partI/leitner.pdf

  • 39

    LiK.XiuC. L.GaoL. M.ShiM.ZhaiY. (2016). Subtractive SELEX using agar beads for screening DNA aptamers with specific affinity to HIV gp41 antigen. Nan Fang Yi Ke Da Xue Xue Bao36, 15921598. 10.3969/j.issn.1673-4254.2016.12.01

  • 40

    Los Alamos HIV Molecular Immunology Database (2021a). Gp160 Ab Epitope Map. Available online at: https://www.hiv.lanl.gov/content/immunology/maps/ab/gp160.html (accessed September 15, 2021).

  • 41

    Los Alamos HIV Molecular Immunology Database (2021b). Search Antibody Database 10E8 MAb. Available online at: https://www.hiv.lanl.gov/content/immunology/ab_search.html?results=Searchandprotein=gp160andinrange=671-683andmab_name=10E8 (accessed February 27, 2022).

  • 42

    Los Alamos HIV Sequence Database (2021a). HIV Circulating Recombinant Forms (CRFs). Available online at: https://www.hiv.lanl.gov/content/sequence/HIV/CRFs/CRFs.html#CRF01 (accessed September 15, 2021).

  • 43

    Los Alamos HIV Sequence Database (2021b). Sequence Search Interface. Available online at: https://www.hiv.lanl.gov/components/sequence/HIV/search/search.html (accessed November 19, 2020).

  • 44

    LouisJ. M.BaberJ. L.GhirlandoR.AnianaA.BaxA.RocheJ. (2016). Insights into the conformation of the membrane proximal regions critical to the trimerization of the HIV-1 gp41 ectodomain bound to dodecyl phosphocholine micelles. PLoS ONE11:e0160597. 10.1371/journal.pone.0160597

  • 45

    LuW.ChenS.YuJ.BehrensR.WigginsJ.ShererN.et al. (2019). The polar region of the HIV-1 envelope protein determines viral fusion and infectivity by stabilizing the gp120-gp41 association. J. Virol. 93, e02128e02118. 10.1128/JVI.02128-18

  • 46

    LuqueF. J.CamarasaM. (2021). HIV-1 envelope spike MPER: from a vaccine target to a new druggable pocket for novel and effective fusion inhibitors. Chem. Med. Chem. 16, 105107. 10.1002/cmdc.202000411

  • 47

    MinkM.MosierS. M.JanumpalliS.DavisonD.JinL.MelbyT.et al. (2005). Impact of human immunodeficiency virus type 1 gp41 amino acid substitutions selected during enfuvirtide treatment on gp41 binding and antiviral potency of enfuvirtide in vitro. J. Virol. 79, 1244712454. 10.1128/JVI.79.19.12447-12454.2005

  • 48

    MourezT.SimonF.PlantierJ. C. (2013). Non-M variants of human immunodeficiency virus type 1. Clin. Microbiol. Rev. 26, 448461. 10.1128/CMR.00012-13

  • 49

    MurakamiT.FreedE. O. (2000). The long cytoplasmic tail of gp41 is required in a cell type-dependent manner for HIV-1 envelope glycoprotein incorporation into virions. Proc. Natl. Acad. Sci. U.S.A. 97, 343348. 10.1073/pnas.97.1.343

  • 50

    OldfieldV.KeatingG. M.PloskerG. (2005). Enfuvirtide: a review of its use in the management of HIV infection. Drugs65, 11391160. 10.2165/00003495-200565080-00007

  • 51

    Özkaya SahinG.HolmgrenB.da SilvaZ.NielsenJ.NowroozalizadehS.EsbjörnssonJ.et al. (2012). Potent intratype neutralizing activity distinguishes human immunodeficiency virus type 2 (HIV-2) from HIV-1. J. Virol. 86, 961971. 10.1128/JVI.06315-11

  • 52

    PanceraM.ZhouT.DruzA.GeorgievI. S.SotoC.GormanJ.et al. (2014). Structure and immune recognition of trimeric pre-fusion HIV-1 Env. Nature514, 455461. 10.1038/nature13808

  • 53

    Pérez-AlvarezL.CarmonaR.OcampoA.AsoreyA.MirallesC.Pérez de CastroS.et al. (2006). Long-term monitoring of genotypic and phenotypic resistance to T20 in treated patients infected with HIV-1. J. Med. Virol. 78, 141147. 10.1002/jmv.20520

  • 54

    PlantierJ.-C.LeozM.DickersonJ. E.De OliveiraF.CordonnierF.LeméeV.et al. (2009). A new human immunodeficiency virus derived from gorillas. Nat. Med. 15, 871872. 10.1038/nm.2016

  • 55

    QadirM. I.MalikS. A. (2010). HIV fusion inhibitors. Rev. Med. Virol. 20, 2333. 10.1002/rmv.631

  • 56

    RantalainenK.CottrellC. A. (2019). Full length HIV-1 Env AMC011 in complex with PGT151 Fab. PLoS Pathog. (2019) 15:e1007920. 10.2210/pdb6olp/pdb

  • 57

    ReyM. A.KrustB.LaurentA. G.GuétardD.MontagnierL.HovanessianA. G. (1989). Characterization of an HIV-2-related virus with a smaller sized extracellular envelope glycoprotein. Virology173, 258267. 10.1016/0042-6822(89)90242-0

  • 58

    RobertsonD. L.AndersonJ. P.BradacJ. A.CarrJ. K.FoleyB.FunkhouserR. K.et al. (2000). HIV-1 nomenclature proposal. Science288, 5555. 10.1126/science.288.5463.55d

  • 59

  • 60

    ScharfL.ScheidJ. F.LeeJ. H.WestA. P.ChenC.GaoH.et al. (2014). Antibody 8ANC195 reveals a site of broad vulnerability on the HIV-1 envelope spike. Cell Rep. 7, 785795. 10.1016/j.celrep.2014.04.001

  • 61

    SimonF.MauclèreP.RoquesP.Loussert-AjakaI.Müller-TrutwinM. C.SaragostiS.et al. (1998). Identification of a new human immunodeficiency virus type 1 distinct from group M and group O. Nat. Med. 4, 10321037. 10.1038/2017

  • 62

    SliepenK.HanB. W.BontjerI.MooijP.GarcesF.BehrensA.-J.et al. (2019). Structure and immunogenicity of a stabilized HIV-1 envelope trimer based on a group-M consensus sequence. Nat Commun. 10:2355. 10.1038/s41467-019-10262-5

  • 63

    SmithT.MasciotraS.LuoW.SullivanV.SwitzerW. M.JohnsonJ. A.et al. (2021). Broadly neutralizing HIV-1 antibody reactivity in HIV tests: implications for diagnostics. AIDS35, 15611565. 10.1097/QAD.0000000000002898

  • 64

    TamuraK.StecherG.PetersonD.FilipskiA.KumarS. (2013). MEGA6: molecular evolutionary genetics analysis version 6.0. Mol. Biol. Evol. 30, 27252729. 10.1093/molbev/mst197

  • 65

    Torrents de la PeñaA.RantalainenK.CottrellC. A.AllenJ. D.van GilsM. J.TorresJ. L.et al. (2019). Similarities and differences between native HIV-1 envelope glycoprotein trimers and stabilized soluble trimer mimetics. PLoS Pathog. 15:e1007920. 10.1371/journal.ppat.1007920

  • 66

    VisseauxB.DamondF.MatheronS.DescampsD.CharpentierC. (2016). HIV-2 molecular epidemiology. Infect. Genet. Evol. 46, 233240. 10.1016/j.meegid.2016.08.010

  • 67

    WangQ.ZhangL. (2020). Broadly neutralizing antibodies and vaccine design against HIV-1 infection. Front. Med. 14, 3042. 10.1007/s11684-019-0721-9

  • 68

    WensingA. M.CalvezV.Ceccherini-SilbersteinF.CharpentierC.GünthardH. F.ParedesR.et al. (2019). 2019 Update of the drug resistance mutations in HIV-1. Top. Antivir. Med. 27, 111–121. Available online at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6892618/pdf/tam-27-111.pdf

  • 69

    WymaD. J.KotovA.AikenC. (2000). Evidence for a stable interaction of gp41 with Pr55Gag in immature human immunodeficiency virus type 1 particles. J. Virol. 74, 93819387. 10.1128/JVI.74.20.9381-9387.2000

  • 70

    XuW.PuJ.SuS.HuaC.SuX.WangQ.et al. (2019). Revisiting the mechanism of enfuvirtide and designing an analog with improved fusion inhibitory activity by targeting triple sites in gp41. AIDS33, 15451555. 10.1097/QAD.0000000000002208

  • 71

    YamaguchiJ.VallariA.McArthurC.SthreshleyL.ClohertyG. A.BergM. G.et al. (2020). Brief report: complete genome sequence of CG-0018a-01 establishes HIV-1 subtype L. J. Acquir. Immune Defic. Syndr. 83, 319322. 10.1097/QAI.0000000000002246

  • 72

    YuanM.CottrellC. A.OzorowskiG.van GilsM. J.KumarS.WuN. C.et al. (2019). Conformational plasticity in the HIV-1 fusion peptide facilitates recognition by broadly neutralizing antibodies. Cell Host Microbe25, 873883. 10.1016/j.chom.2019.04.011

Summary

Keywords

HIV, transmembrane glycoprotein gp41, transmembrane glycoprotein gp36, conservation, variants, antibody binding domains

Citation

Valadés-Alcaraz A, Reinosa R and Holguín Á (2022) HIV Transmembrane Glycoprotein Conserved Domains and Genetic Markers Across HIV-1 and HIV-2 Variants. Front. Microbiol. 13:855232. doi: 10.3389/fmicb.2022.855232

Received

14 January 2022

Accepted

29 March 2022

Published

27 May 2022

Volume

13 - 2022

Edited by

Joris Hemelaar, University of Oxford, United Kingdom

Reviewed by

Kelly Lee, University of Washington, United States; Suvankar Ghorai, Raiganj University, India

Updates

Copyright

*Correspondence: África Holguín

This article was submitted to Virology, a section of the journal Frontiers in Microbiology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics