Abstract
Free full text
G→A Hypermutation in Protease and Reverse Transcriptase Regions of Human Immunodeficiency Virus Type 1 Residing in Resting CD4+ T Cells In Vivo
Abstract
In vitro studies have shown that the host cytidine deaminase APOBEC3G causes lethal hypermutation in human immunodeficiency virus type 1 reverse transcripts unless its incorporation into virions is blocked by Vif. By examining stably archived sequences in resting CD4+ T cells, we show that hypermutation occurs in most if not all infected individuals. Hypermutated sequences comprised >9% of archived species in resting CD4+ T cells but were not found in plasma virus. Mutations occurred in predicted contexts, with notable hotspots. Thus, defects in Vif function in vivo give rise to hypermutated viral genomes that can be integrated but do not produce progeny viruses.
A component of innate immune defense against viruses is ABOBEC3G, a cytidine deaminase which causes G→A hypermutation in retroviral genomes (9, 11, 16, 18, 26, 35, 36). ABOBEC3G is incorporated into assembling virions and then deaminates cytidines of the single-stranded viral cDNA that is initially synthesized by reverse transcriptase (RT) upon entry of the virus into a new host cell (11, 16, 18, 35, 36). This C→U deamination on the minus strand of the reverse transcript leads to fixation of G→A mutations. Pioneering work by Sheehy et al. identified APOBEC3G as an antiviral factor and showed that human immunodeficiency virus type 1 (HIV-1) Vif overcomes its effects (26). Vif counteracts APOBEC3G by inhibiting its translation and accelerating its degradation, thereby preventing APOBEC3G incorporation into HIV-1 virions (19, 20, 25, 27, 30). Studies of target sequence specificity of ABOBEC3G have revealed a context dependence for the two nucleotides immediately upstream of the targeted dC (1, 11, 35), consistent with reports of G→A hypermutation in HIV-1 sequences from infected individuals (3, 8, 13). In these studies, minus-strand C→U deamination resulted in fixation of G→A mutations within GA and GG dinucleotides with an extreme bias for GGG sequences (3, 13, 31). Within hypermutated sequences, 20 to 94% of guanine nucleotides in these contexts were mutated (13). Recent studies have delineated the preferences of ABOBEC3G and ABOBEC3F, a closely related protein of similar function (33, 34), as GG and GA, respectively (17, 33).
Most studies of the antiviral effects of APOBEC3G have utilized Vif− HIV-1 constructs, and there remains uncertainty about how often G→A hypermutation occurs in HIV-1-infected individuals and about the fate of hypermutated viruses. Janini et al. found hypermutation in 43% of patient samples (13). Hypermutated sequences had in-frame stop codons that would interfere with the production of viral proteins (13). However, the replication defect shown by Vif− viruses may actually operate at an early, postentry stage in viral replication and decrease the formation of proviruses (29, 32). One possibility is that the deoxyuridines produced by deamination undergo uracil excision by uracil-DNA glycosylases, exposing the viral cDNA to nucleases (11).
To understand the nature and distribution of hypermutated sequences in vivo, we analyzed the protease and RT regions of HIV-1 sequences obtained from resting CD4+ T cells of nine patients who had prolonged suppression of viremia to below the limit of detection on highly active antiretroviral therapy (HAART). In patients on successful long-term HAART, labile unintegrated forms of HIV-1 decay (2, 22), and resting CD4+ T cells harbor stably integrated, latent viral genomes, some of which are replication competent (4, 5, 10). This cellular reservoir persists in patients on HAART (6, 21, 28) and continually releases virus into the plasma at low levels (12, 14). We analyzed hypermutation in both the cellular and plasma compartments of these patients.
Resting CD4+ T cells were isolated from peripheral blood mononuclear cells by use of negative selection to remove monocytes, natural killer (NK) cells, B cells, CD8+ T cells, and activated CD4+ T cells as previously described (7). The resulting populations were >90% pure. pol gene sequences from resting cells were obtained by a single genome sequencing method (P. Kwon, M. Wind-Rotolo, and R. F. Siliciano, unpublished data). Plasma sequences were obtained by frequent sampling over a 3-month period and an ultrasensitive RT-PCR capable of separately genotyping RT and protease from patients with viral loads below 50 copies/ml (14). Sequences were analyzed using a program (www.hiv.lanl.gov/HYPERMUT/hypermut.html) that compares each patient sequence to a patient-specific consensus to determine the frequency and context of G→A mutations (23). Hypermutated sequences were defined as having >5% of the total Gs mutated to A but <1% A→G mutations.
Of 2,024 independent RT and protease sequences from the plasma virus of nine patients, not a single one was hypermutated. In contrast, from a total of 317 independent pol clones from the latent cellular reservoir, hypermutation was detected in 19 of 302 (6.3%) protease sequences and 21 of 309 (6.8%) RT sequences. Both protease and RT were hypermutated in 12 clones. Thus, there were a minimum of 28 (8.8%) hypermutated genomes among the 317 pol clones. This is a minimal estimate, because sequencing repeatedly failed for some pol clones (15 protease and 8 RT sequences), possibly due to hypermutation, and because sequencing additional regions of the viral genome, particularly env and the 5′ half of nef (35), may have revealed additional hypermutation. At least one hypermutated sequence was found with each patient studied. Phylogenetic analysis showed patient-specific clustering and extreme divergence of the hypermutated sequences within patient-specific clusters (Fig. (Fig.1).1). The routine detection of hypermutated sequences in resting CD4+ T cells of patients with prolonged suppression of active viral replication on HAART suggests that hypermutated viral genomes can enter the stable pool of integrated HIV-1 DNA in these cells. The absence of hypermutated plasma sequences indicates that hypermutation blocks virus production from these proviruses.
Within hypermutated sequences, G→A mutations were found at an average of 11% (RT) and 13.5% (protease) of all Gs (range, 5 to 22%), while only 0.18% (RT) and 0.44% (protease) of As were mutated to guanine (Table (Table1).1). In 98% of the cases, mutations were found in either GA or GG dinucleotides, with about 20% in GA and 80% in either GG or GGG sequences. In contrast, Janini et al. (13) noted a 3F-like preference for mutation within the GA context in protease sequences from unfractionated peripheral blood mononuclear cells of viremic patients. We found a preference for hypermutation at Gs preceded by T, consistent with a recent study defining TGGG as the tetranucleotide consensus sequence for ABOBEC3G (35). Our results are consistent with the idea of both ABOBEC3G and ABOBEC3F contributing to in vivo hypermutation.
TABLE 1.
Region | Length (no. of nucleotides) | Total no. of G → A mutations | % Of Gs mutated | Total no. of A → G mutations | % Of As mutated | % Of mutationsa
| ||||
---|---|---|---|---|---|---|---|---|---|---|
GG | GA | GT | GC | GGG | ||||||
RT (n = 21) | 710 | 332 | 11 | 11 | 0.18 | 40 | 18 | 1 | 1 | 40 |
Protease (n = 19) | 297 | 165 | 13.5 | 9 | 0.44 | 59 | 17 | 0 | 2 | 22 |
Mutational hotspots, defined as G→A mutations in >50% of hypermutated sequences, were identified in the protease and RT genes (Table (Table22 and Table Table3).3). The substitution at nucleotide (nt) 125 in protease was found in 63% of sequences and resulted in a stop codon. Four other hotspots resulted in a change at a glycine residue. This is as expected, since glycine codons (GGN) contain the targeted GG dinucleotide. In RT, nine hotspots were identified (Table (Table3).3). The Gs at nt 151, 263, 333, and 690 occurred at the beginning of GGG trinucleotides and were mutated in 80 to 95% of sequences. RT mutations resulted in substitutions at methionine (n = 4), tryptophan (n = 2), or glycine (n = 2). The mutations in methionine codons (ATG) produced changes to isoleucine (ATA), one of which was at amino acid 184 and corresponded to a known intermediate in the pathway to lamivudine resistance. The mutations within tryptophan codons resulted in stop codons at amino acids 88 and 212. The mutation at nt 333 produced a synonymous change detected in 95% of the sequences.
TABLE 2.
nt positiona | 42 | 46 | 47 | 48 | 49 | 51 | 60 | 73 | 88 | 100 | 108 | 114 | 118 | 122 | 125 | 126 | 142 | 143 | 144 | 145 | 146 | 151 | 152 | 154 | 178 | 183 | 193 | 202 | 203 | 217 | 232 | 233 | 256 | 257 | 260 | 270 | 280 | 281 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
G → A contextb | GAN | GGG | GGG | GGG | GGG | GCN | GGN | GAN | GAN | GAN | GAN | GCN | GGN | GAN | GGN | GAN | GGG | GGG | GGG | GGN | GAN | GGN | GAN | GGN | GAN | GGN | GAN | GGN | GAN | GGN | GGN | GAN | GGN | GAN | GAN | GAN | GGN | GGN | |
99.E15 | A | A | A | ||||||||||||||||||||||||||||||||||||
99.E19 | A | A | A | A | |||||||||||||||||||||||||||||||||||
99.E36 | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||||||
113.G2 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | ||||||||||||||||||||
113.I6 | A | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||||||
113.I16 | A | A | A | A | A | A | |||||||||||||||||||||||||||||||||
113.I49 | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||||||
113.I105 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | ||||||||||||||||||||
134.D1 | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||||||
135.F36 | A | A | A | A | A | ||||||||||||||||||||||||||||||||||
135.F39 | A | A | A | A | A | ||||||||||||||||||||||||||||||||||
136.G2 | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||||||||
139.F1 | A | A | A | A | A | A | |||||||||||||||||||||||||||||||||
139.H1 | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||||||||
147.E11 | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||||||
147.E22 | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||||||
154.F7 | A | A | A | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||||
154.F23 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||
154.G17 | A | A | A | A | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||
% Of sequences with mutation | 5 | 16 | 16 | 21 | 11 | 5 | 32 | 11 | 11 | 21 | 32 | 5 | 5 | 5 | 63* | 11 | 37 | 42 | 53 | 26 | 11 | 53 | 5 | 47 | 11 | 11 | 16 | 32 | 5 | 42 | 26 | 5 | 74 | 16 | 16 | 5 | 68 | 5 | |
Amino acid positionc | 42 | 48 | 52 | 86 | 94 | ||||||||||||||||||||||||||||||||||
Substitutiond | W-* | G-K | G-S | G-R | G-S |
TABLE 3.
nt positiona
| 48
| 52
| 85
| 123
| 124
| 130
| 133
| 134
| 151
| 157
| 212
| 213
| 215
| 233
| 246
| 263
| 264
| 277
| 295
| 296
| 327
| 333
| 367
| 374
| 375
| 421
| 422
| 453
| 454
| 458
| 463
| 515
| 552
| 568
| 586
| 609
| 615
| 635
| 636
| 637
| 638
| 686
| 690
| 691
|
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
G → A contextb | GGN | GG | GAN | GGN | GAN | GAN | GGG | GGN | GGG | GAN | GGN | GAN | GAN | GAN | GAN | GGG | GGN | GGN | GGG | GGN | GGN | GGG | GAN | GGN | GAN | GGG | GGN | GGG | GGN | GGN | GGN | GAN | GGN | GGN | GGG | GGN | GAN | GGG | GGG | GGG | GGN | GGN | GGG | GGN |
99.E15 | A | A | A | A | A | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||||||
113.I6 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | ||||||||||||||||||
113.I16 | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||||||||||||
113.I49 | A | A | A | A | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||||||||
113.I96 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||
134.D14 | A | A | A | A | A | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||||||
134.D26 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | ||||||||||||||||||||||||||
135.F35 | A | A | A | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||||||||
135.F36 | A | A | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||||||||||
135.F39 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | ||||||||||||||||||||||||||
136.G2 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||
139.F1 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||||
139.F21 | A | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||||||||||
139.H1 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||||
147.E8 | A | A | A | A | ||||||||||||||||||||||||||||||||||||||||
147.E11 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||
147.E13 | A | A | A | A | A | A | A | |||||||||||||||||||||||||||||||||||||
147.E22 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | ||||||||||||||||||||||||||||||
148.G4 | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||||||||||||
154.F12 | A | A | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||||||||||||
154.F7 | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | |||||||||||||||||||||||
% Of sequences with mutation | 52 | 24 | 14 | 57 | 10 | 19 | 57 | 10 | 90 | 14 | 24 | 10 | 19 | 14 | 14 | 81 | 24 | 38 | 19 | 14 | 33 | 95 | 14 | 10 | 14 | 19 | 24 | 38 | 19 | 43 | 29 | 19 | 71 | 24 | 19 | 14 | 14 | 43 | 57 | 38 | 10 | 38 | 86 | 19 |
Amino acid positionc | 16 | 41 | 45 | 51 | 88 | 111 | 184 | 212 | 230 | |||||||||||||||||||||||||||||||||||
Substitutiond | M-I | M-I | G-R/K | G-R | W-* | None | M-I | W-* | M-I |
We have detected hypermutated sequences in resting CD4+ T cells from patients who had prolonged suppression of viremia on HAART. Hypermutated sequences with characteristic APOBEC3G/F-mediated changes constituted >9% of the viral genomes in this compartment and were found in every patient. This level may reflect some degree of accumulation of defective genomes that do not direct the synthesis of a full complement of viral proteins, thereby sparing the infected cell from viral cytopathic effects. Our data suggest that mutated viral genomes are able to integrate into cellular DNA and persist in resting CD4+ T cells even when viral replication is halted with HAART. However, these hypermutated genomes are subsequently incapable of producing virus, as indicated by the fact that the viruses released into the plasma at low level in patients on HAART were completely devoid of hypermutated sequences.
Acknowledgments
This work was supported by the Johns Hopkins University School of Medicine General Clinical Research Center, grant number M01-RR00052, from the National Center for Research Resource/National Institutes of Health and by National Institutes of Health grants AI43222 and AI51178 and a grant from the Doris Duke Charitable Foundation.
REFERENCES
Articles from Journal of Virology are provided here courtesy of American Society for Microbiology (ASM)
Full text links
Read article at publisher's site: https://doi.org/10.1128/jvi.79.3.1975-1980.2005
Read article for free, from open access legal sources, via Unpaywall: https://jvi.asm.org/content/jvi/79/3/1975.full.pdf
Citations & impact
Impact metrics
Article citations
Consistency of drug-resistant mutations in plasma and peripheral blood mononuclear cells of patients with treatment-naïve and treatment-experienced HIV-1 infection.
Front Cell Infect Microbiol, 13:1249837, 19 Dec 2023
Cited by: 0 articles | PMID: 38179423 | PMCID: PMC10766352
The effect of random virus failure following cell entry on infection outcome and the success of antiviral therapy.
Sci Rep, 13(1):17243, 11 Oct 2023
Cited by: 0 articles | PMID: 37821517 | PMCID: PMC10567758
Potent dual block to HIV-1 infection using lentiviral vectors expressing fusion inhibitor peptide mC46- and Vif-resistant APOBEC3G.
Mol Ther Nucleic Acids, 33:794-809, 11 Aug 2023
Cited by: 0 articles | PMID: 37662965 | PMCID: PMC10470399
Role of Proviral HIV-1 DNA Genotyping for People Living with HIV (PLWH) Who Had Low-Level Viremia While Receiving Antiretroviral Therapy.
Infect Drug Resist, 16:4697-4706, 19 Jul 2023
Cited by: 0 articles | PMID: 37489173 | PMCID: PMC10363348
Differential Activity of APOBEC3F, APOBEC3G, and APOBEC3H in the Restriction of HIV-2.
J Mol Biol, 434(2):167355, 10 Nov 2021
Cited by: 4 articles | PMID: 34774569 | PMCID: PMC8752514
Go to all (123) article citations
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Persistence of wild-type virus and lack of temporal structure in the latent reservoir for human immunodeficiency virus type 1 in pediatric patients with extensive antiretroviral exposure.
J Virol, 76(18):9481-9492, 01 Sep 2002
Cited by: 96 articles | PMID: 12186930 | PMCID: PMC136462
Human immunodeficiency virus type 1 DNA sequences genetically damaged by hypermutation are often abundant in patient peripheral blood mononuclear cells and may be generated during near-simultaneous infection and activation of CD4(+) T cells.
J Virol, 75(17):7973-7986, 01 Sep 2001
Cited by: 136 articles | PMID: 11483742 | PMCID: PMC115041
Cellular APOBEC3G restricts HIV-1 infection in resting CD4+ T cells.
Nature, 435(7038):108-114, 13 Apr 2005
Cited by: 324 articles | PMID: 15829920
Insertions in the human immunodeficiency virus type 1 protease and reverse transcriptase genes: clinical impact and molecular mechanisms.
Antimicrob Agents Chemother, 49(7):2575-2582, 01 Jul 2005
Cited by: 25 articles | PMID: 15980322 | PMCID: PMC1168704
Review Free full text in Europe PMC
Funding
Funders who supported this work.
NCRR NIH HHS (2)
Grant ID: M01-RR00052
Grant ID: M01 RR000052
NIAID NIH HHS (5)
Grant ID: R37 AI051178
Grant ID: AI51178
Grant ID: R01 AI051178
Grant ID: AI43222
Grant ID: R01 AI043222