Hemagglutinin Structure and Activities
- Steven J. Gamblin,
- Sébastien G. Vachieri,
- Xiaoli Xiong,
- Jie Zhang,
- Stephen R. Martin and
- John J. Skehel
- Structural Biology of Disease Processes Laboratory, The Francis Crick Institute, London NW1 1AT, United Kingdom
- Correspondence: John.Skehel{at}crick.ac.uk
Abstract
Hemagglutinins (HAs) are the receptor-binding and membrane fusion glycoproteins of influenza viruses. They recognize sialic acid–containing, cell-surface glycoconjugates as receptors but have limited affinity for them, and, as a consequence, virus attachment to cells requires their interaction with several virus HAs. Receptor-bound virus is transferred into endosomes where membrane fusion by HAs is activated at pH between 5 and 6.5, depending on the strain of virus. Fusion activity requires extensive rearrangements in HA conformation that include extrusion of a buried “fusion peptide” to connect with the endosomal membrane, form a bridge to the virus membrane, and eventually bring both membranes close together. In this review, we give an overview of the structures of the 16 genetically and antigenically distinct subtypes of influenza A HA in relation to these two functions in virus replication and in relation to recognition of HA by antibodies that neutralize infection.
HEMAGGLUTININ STRUCTURE
Hemagglutinins (HAs) project from the virus surface membrane, as 30–50 Å-diameter, 135 Å-long glycoprotein spikes (Wasilewski et al. 2012). Each spike contains three identical subunits formed of two glycopolypeptides, HA1 and HA2. The subunits are divided into four subdomains: the membrane-distal receptor-binding subdomain and the inactive, vestigial esterase subdomain of HA1; and the membrane-proximal fusion subdomain, which contains parts of HA1 and HA2, and the membrane anchor subdomain formed by membrane-associated residues of HA2 (Fig. 1).
The receptor-binding subdomain R contains the receptor-binding site, which is a shallow depression at the membrane-distal tip of the molecule (Wilson et al. 1981; Rogers et al. 1983; Weis et al. 1988). Its base contains four conserved residues—tyrosine-98, histidine-183, tyrosine-195, and tryptophan-153—linked by hydrogen bonds (see Fig. 2). The site is enclosed by the 130-loop, the 150-loop, the 220-loop, and the 190-helix. Antigenic sites to which antibodies bind to block infectivity either directly or following Fc-receptor recognition, are clustered in and around the receptor-binding site (Knossow and Skehel 2006; Whittle et al. 2011; Xiong et al. 2015; McCarthy et al. 2018). Amino acid substitutions selected by such antibodies can influence both antibody binding and receptor binding. Those that result in the introduction of additional carbohydrate side chains can have particularly important consequences for both activities (Marinina et al. 2003; Lin et al. 2012; Yang et al. 2015).
THE VESTIGIAL ESTERASE SUBDOMAIN, E
The vestigial esterase subdomain E is named for its similarity to part of the 9-O-acetylase of the hemagglutinin-esterase-fusion glycoprotein of influenza C virus, which is the evolutionary precursor of influenza A (Rosenthal et al. 1998). In influenza C, the esterase functions as a receptor-destroying enzyme, analogous to the sialic acid–receptor-destroying enzymes, the neuraminidases, of influenza A and B viruses (Colman 1994). The E subdomain, which has no esterase activity, links the R subdomain to the HA1 component of the fusion subdomain (Fig. 1). Structurally, several loops from E interact with the interhelical loop in the fusion subdomain (Fig. 1E).
THE FUSION SUBDOMAIN, F
The amino terminus of HA1 located in the F subdomain is the membrane-proximal component of the HA ectodomain, formed by the removal of the biosynthetic signal sequence during virus replication (McCauley et al. 1979; Porter et al. 1979). Amino-terminal residues of HA1 form one strand of a five-stranded, membrane-proximal β-sheet in each subunit of the trimer, following which the 30-loop inserts between and packs against the central α-helices (B-helices) of the subdomain. HA1 then extends toward the membrane-distal part of the molecule, antiparallel with its carboxy-terminal residues (Fig. 1E). The HA1 chain forms the E and the R subdomains before it returns to F and terminates at the site of proteolytic cleavage of HA0, the biosynthetic precursor of HA1 and HA2 (Fig. 1E; Chen et al. 1998; Böttcher-Friebartshäuser et al. 2013). Cleavage is essential for membrane fusion activity and virus infectivity (Klenk et al. 1975; Lazarowitz and Choppin 1975).
The major components of the F subdomain are contributed by HA2 (Fig. 1D). The hydrophobic amino-terminal region of HA2 is buried in the trimer interface, ∼30 Å from the virus membrane (Wilson et al. 1981; Chen et al. 1998). The 23 amino-terminal residues are called the fusion peptide because of their assumed direct role in membrane fusion. They are linked to one of two antiparallel strands of a membrane-proximal, five-stranded β-sheet that is joined to α-helix A. The α-helix A is the shorter helix of a central α-helical hairpin-like structure that is the most prominent structural feature of F. The hairpin itself is formed by a 17-residue-long extended chain, called the interhelical loop, that links the carboxyl terminus of the shorter α-helix A, to the amino terminus of the longer α-helix B. In their amino-terminal, membrane-distal regions, the B helices form an eight-turn, trimeric coiled-coil that opens, as the α-helices extend back toward the membrane-associated region into a tripod-like structure (Fig. 1D). The fusion peptide inserts between the three helices of the tripod ∼20 Å beneath the carboxyl terminus of the coiled-coil.
Attached to the carboxyl termini of the B helices, two antiparallel β-strands complete the five-stranded membrane-proximal β-sheet. The sheet is linked to a short α-helix that joins the 160-helix that is oriented almost parallel to the virus membrane. The 160-helix links the ectodomain to the membrane anchor subdomain through an eight-residue flexible linker between two glycine residues (Benton et al. 2018).
THE MEMBRANE ANCHOR SUBDOMAIN, M
The major component of M is a triple-helical structure, residues 185–204 of each subunit of the trimer. The helices splay apart at an angle of ∼60° at the carboxyl terminus of the coil (Fig. 1D; Benton et al. 2018). The structures of the six remaining residues of M and the 11 carboxy-terminal residues of HA2 that are outside the virus membrane have not been determined.
STRUCTURAL COMPARISONS
Most of the information on the structures of HAs have been obtained by X-ray crystallography using crystals formed of soluble ectodomains, obtained by proteolytic removal of the membrane anchor subdomains (Brand and Skehel 1972) (used, e.g., in Wilson et al. 1981; Ha et al. 2001; Gamblin et al. 2004; Stevens et al. 2004; Liu et al. 2009) or by expression of truncated cDNA (e.g., Chen et al. 1995, in bacterial cells; Chen et al. 1998, in mammalian cells; and Xiong et al. 2013a,b, in insect cells). The structure of the complete HA was determined by cryo-electron microscopy (Benton et al. 2018). In addition to revealing the structure of the M subdomain, this complete HA structure indicates that the isolated ectodomains retain their three-dimensional structures independently of the membrane anchor, as concluded before from spectroscopic analyses (Flanagan and Skehel 1977).
GROUP-SPECIFIC STRUCTURAL FEATURES
Antigenically (WHO 1980), genetically (Air 1981), and structurally (Ha et al. 2002; Russell et al. 2004) the 16 HA subtypes form two groups with three clades in Group 1 and two in Group 2 (Fig. 3A). Structurally, HAs of all 16 subtypes are very similar (e.g., Ha et al. 2002; Russell et al. 2004; Liu et al. 2009; Lu et al. 2013; Vachieri et al. 2014; Tzarum et al. 2015, 2017; Song et al. 2017) and the structures of the four individual subdomains are, for the most part, conserved (Fig. 3B). However, differences in the orientations of the membrane-distal R and E subdomains relative to the membrane-proximal F subdomain are among the most prominent group-specific structural differences between HAs (Fig. 3; Table 1; Ha et al. 2001). They seem to derive primarily from differences in the structure of the interhelical loops of the F subdomain connecting the carboxyl terminus of helix A and the amino terminus of helix B, and from interactions between the loops and the E and F subdomains of the same and neighboring subunits.
The main elements of structural distinction shared by group members are in the F subdomain, near the carboxyl terminus of α-helix A and near the amino terminus of α-helix B.
At the carboxy-terminal region of α-helix A in Group 1 HAs, HA2 methionine-59 makes hydrophobic interactions with conserved HA1 phenylalanine-294 and forms the carboxyl terminus of α-helix A. The penultimate residue of helix A is conserved lysine-58, which forms a salt bridge with conserved HA2 glutamic acid-97 that helps position the amino-terminal region of the interhelical loop close to α-helix B (Fig. 3A).
By comparison, in Group 2 HAs, HA2 threonine-59 is ∼7 Å distant from HA1 phenylalanine-294 (Fig. 3A).
These differences between Group 1 and Group 2 HAs at the carboxyl terminus of α-helix A correlate with α-helix A being one turn longer in Group 1 HAs than in Group 2 and with the interhelical loop being shorter in Group 1 HAs.
At the amino terminus of α-helix B in Group1 HAs, conserved glutamic acid-74 forms a salt bridge with Arginine-76 of a neighboring α-helix B that cross-links the subunits of the trimer. The tall turns at the carboxyl terminus of the interhelical loop are stabilized by hydrogen bonds between group-specific HA1 glutamic acid-107 (aspartic acid in H8) and the peptide amides of residues 75 and 76. By contrast, in Group 2 HAs the turn is tighter arising, intriguingly, from clade-specific features.
In clade H3,4,14, of Group 2, the amino terminus of α-helix B is glycine-75. Because glycine residues can adopt stereochemical conformations not favored for other amino acids, glycine-75 facilitates the formation of sharp turns between the carboxyl termini of the interhelical loops and the amino termini of the B α-helices (Fig. 3B). Salt bridges formed between HA2 glutamic acid-74 and arginine-76 of neighboring subunits, as in Group 1 HAs, together with the additional salt bridge formed between arginine-76 and clade-specific glutamic acid-81, cross-link the carboxy-terminal region of the loops in all three subunits of the trimer and stabilize this conformation.
A number of interactions involving H3,4,14 clade-specific charged residues also appear to stabilize the carboxy-terminal region of the interhelical loop by linking it to residues in the E subdomain.
In clade H7,10,15, the residue at position HA2 75 is not glycine and HA2 76 is not arginine. Nevertheless, a Group 2–specific sharp turn is formed at the amino terminus of α-helix B that correlates in this case, with a combination of hydrophobic interactions between nonpolar residues in the interhelical loop and α-helix B, and with the aliphatic parts of the side chains of a number of charged HA1 residues. In particular, the sharp turn appears to be secured by conserved phenylalanine HA2 70 of the interhelical loop docking into a cavity edged by these HA1-residue side chains and also by a network of salt bridges between HA1 and HA2 (see Fig. 4).
GROUP-SPECIFIC ROTATION OF R AND E SUBDOMAINS RELATIVE TO THE F SUBDOMAIN
Differences in the conformation of the interhelical loops in HAs of different clades and in interactions made by them are major influences on the different degrees of rotation of the membrane-distal subdomains of the different clades in relation to helices A and B.
In the amino-terminal region of the interhelical loops, salt bridges and hydrogen bonds formed between loop residues of Group 2 HAs and helix B, together with hydrophobic interactions, are involved in positioning the loops close to helix B (Fig. 3). In addition, the contribution of a strand to the membrane-distal β-sheet of subdomain F positions the β-sheet of HA1 close to helix B.
As a consequence, by comparison with Group 1 HAs, the R and E subdomains of Group 2 HAs appear to be rotated around the threefold axis of symmetry (Fig. 3; Table 1). In addition, the positioning of the F subdomain influences the positions adopted by the Group 2 E and R subdomains that are closer to the carboxyl terminus of the interhelical loop and the amino terminus of α-helix B (Fig. 3B,C). The tall turns of group1 HAs by contrast, are stabilized by interactions between group-specific charged residues that preclude the HA1 110-helix from proximity to α-helix B. Conserved serine-107 in the H3,4,14, clade and alanine-107 in the H7,10,15 clade, on the other hand, facilitate the positioning of the 110-α-helix toward the sharp turn at the amino terminus of α-helix B (Fig. 5B).
RECEPTOR-BINDING AFFINITY AND SPECIFICITY
Binding of viruses to cells to be infected and release of viruses from infected cells involves sialic acid recognition by both glycoproteins of the virus membrane, HA and neuraminidase, respectively. Their specificities and activities are required to be balanced for effective virus infection (Kaverin et al. 1998; Mitnaul et al. 2000; Baigent and McCauley 2001; Wagner et al. 2002).
The antigenic properties of both glycoproteins vary during a pandemic period and their activities can also vary in recognition specificity and specific activity. Because inhibition-based assays of their activities are the standard procedures used to characterize viruses isolated during antigenic drift, information on changes in the activities is essential for correct interpretation of the assay results. In addition, this information is required to identify differences in the activities of the glycoproteins that appear on cross-species transfer of viruses, a process that can be involved in the start of a new pandemic.
The attachment of virus to the surface of a cell is a polyvalent interaction between hemagglutinins on the virus and multiple copies of sialic acid, which is the terminal sugar of many carbohydrate side chains. Numerous studies (Rogers and D'Souza 1989; Connor et al. 1994; Gambaryan et al. 1997; Matrosovich et al. 1997; Imai et al. 2012) have shown that sialic acid–receptor binding is species-specific: avian and equine viruses prefer to bind to sialic acid in α-2,3 linkage to galactose, and human viruses prefer α-2,6-linked sialic acid. These preferences correlate with observations that there is an abundance of α-2,6-linked sialic acids in the upper respiratory tract of humans and of α-2,3-linked sialic acids in the intestinal mucosa of birds, the respective primary sites of infection, and with different preferences for cells in the respiratory tract (Ito et al. 1998; Matrosovich et al. 2004; Shinya et al. 2006; van Riel et al. 2007).
Although other factors may be important, the avidity of a virus for a cell surface coated with sialic acid receptors is directly related to the affinity of individual HA binding sites for the particular sialic acid receptor. It is known that binding of receptor analogs to individual HAs is relatively weak; nuclear magnetic resonance (NMR) measurements (Sauter et al. 1989; Hanson et al. 1992) give equilibrium dissociation constants (Kd) for the HA of a human H3 virus of ∼2 mM for α-2,6 sialyllactose (human receptor) and 3 mM for α-2,3 sialyllactose (avian receptor). For mutant HAs with a Leu-226→Gln substitution in the 220-loop of the receptor-binding site (Fig. 2) that was shown to influence binding specificity (Rogers et al. 1983, 1985) and also to have been observed early in the H2 and H3 pandemics (Matrosovich et al. 2000), the values were ∼6 mM for human and 3 mM for avian receptors. The Leu-226→Gln mutation, expected to make the HA more avian-like, has only a small effect in changing a 1.5-fold preference for human receptors to a twofold preference for avian receptors. Similar results have been obtained using microscale thermophoresis (MST) with these and other HAs (Benton et al. 2015). For example, Kds have been determined for an avian H5 HA of ∼20 mM for human receptor and 1 mM for avian receptor (Xiong et al. 2013a).
The weak interactions, consistently detected by such methods, could give rise to very tight binding of viruses as a result of polyvalency. However, given the levels of discrimination frequently observed between human and avian receptors and the hundreds of HAs on the surface of a virus, it is unclear how species specificity would be achieved. Biolayer interferometry has been used to address this question using a range of virus concentrations and varying the amounts of receptor analog on the sensor.
Plots of the transformed data (Benton et al. 2015) are used to calculate an apparent avidity for the virus Kd(virus) at different loadings of receptor analog.
Treating virus binding as an example of a polyvalent interaction, it can be assumed that virus avidity is related to the affinity of a receptor analog for an HA in the following way: where Kd(receptor) is the equilibrium dissociation constant for the binding of a receptor analog to HA and the term mc is a multiplicity coefficient that represents an average for the number and strength of individual HAs interacting with sialic acid. Using Kd(virus) values calculated at a number of different receptor analog loadings and Kd(receptor) values derived from NMR and MST measurements, mc reaches a maximum value of 5.4 at saturating receptor analog loading. This value indicates an upper limit for the number and strength of HA interactions that can be made with a receptor analog–loaded surface and is ultimately related to the size of the footprint that the virus can make. It does not mean that only five HA–sialic acid interactions are made but that multiple successive interactions become less effective in stabilizing binding—a common observation with polyvalent interactions.
The same procedures can be used to estimate the preferences of a given virus for different receptors. For example, (1) the data for individual human H3 HAs suggest virus binding values of 3.47 fM for human receptor and 917 fM for avian receptor, a difference in affinity of 265-fold (Benton et al. 2015), and (2) for an H5 virus binding to a saturated surface where mc = 5.4, the calculated Kd(virus) values would be 0.87 nM and 0.1 fM for human and avian receptors, respectively, a difference in affinity of ∼8 millionfold. Consistent with these values, binding of the H5 virus to surfaces coated with human receptor is only just detectable at very high receptor analog loadings (Xiong et al. 2013a).
For interspecies transmission, as noted above, it is generally the case that avian viruses do not readily infect humans. This is thought to be due at least in part to the fact that avian viruses bind with low affinity to sialic acids of the human receptors found in the upper respiratory tract of humans. This is consistent with the above analyses showing that avian H5 virus binds very weakly to a sensor surface fully saturated with human receptor but very strongly to a surface coated with avian receptor.
However, from transmission experiments that used an avian H5 virus to infect ferrets as surrogate humans (Imai et al. 2012), MST data for the HA of a ferret-transmissible mutant H5 virus that was isolated show that its affinity for human receptors is only slightly increased (Xiong et al. 2013a). The Kd(receptor) of the transmissible mutant HA for the human receptor is 13 mM compared with 21 mM for the wild-type (wt) avian H5 HA, a 1.6-fold change in Kd(receptor) that would result in a 13-fold difference in Kd(virus) for binding to human receptor. However, the affinity for avian receptors was more significantly reduced to Kd(receptor) 32 mM for the mutant, compared to 1.1 mM for the wt HA. Would the modest increase in affinity, if replicated by the virus interaction with a human cell surface, be sufficient to explain the transmissibility of this mutant virus to humans? Or is the more dramatic reduction in the affinity for avian receptors more likely to be involved, by allowing the virus to evade capture by α-2,3-linked sialic acids on respiratory secretions (Couceiro et al. 1993)? Perhaps a combination of both effects is required.
Quantitative studies of receptor binding of this sort have been accompanied by extensive analysis of specificity using array technologies that display large numbers of potential sialoside receptors derived from biological sources or produced synthetically (Blixt et al. 2004; Feizi and Chai 2004; Gao et al. 2019). These arrays have been particularly useful recently in analyzing the receptor-binding properties of viruses isolated in different stages of a pandemic, at which times, receptor-binding variants are initially manifested by decreases in the affinities of viruses for erythrocytes of different species, in hemagglutination tests (Lin et al. 2012). They have, for example, shown the preference of H1- and H3-subtype human viruses for branched glycans with branches consisting of sialylated poly-N-acetyl-lactosamine repeats that might interact at more than one binding site on a hemagglutinin simultaneously (Peng et al. 2017; Byrd-Leotis et al. 2019).
Microarrays have also been used to display and identify potential ligands extracted from human, ferret, and porcine lung tissues (Walther et al. 2013; Byrd-Leotis et al. 2014, 2019). With these preparations it has again been seen that viruses from different species, or with differing antigenic properties, have different binding specificities. In addition, using such arrays it is reported that viruses also bound to nonsialylated, phosphorylated glycans (Byrd-Leotis et al. 2019). Both areas of research, whether or not the receptors identified as branched chain sialosides or as phosphorylated glycans are physiologically involved, provide an important focus for research on the nature of the receptors involved in influenza virus–cell interactions in vivo.
Additional information on recognition specificity and its molecular basis has been obtained from X-ray crystallographic and NMR analyses, and from molecular dynamics simulations of HA-sialoside complexes (Weis et al. 1988; Sauter et al. 1992; Eisen et al. 1997; Ha et al. 2001, 2003; Russell et al. 2006; Lin et al. 2012; Xu et al. 2012; Xiong et al. 2013a,b; Collins et al. 2014; Macchi et al. 2016). These studies indicate that the sialic acid moiety of the receptor interacts through hydrogen bonds between its carboxylate and hydroxylated amino acids at residues 136 and 137 of the 130-loop, between the peptide carbonyl of residue 135 and the amide of the N-acetamido substituent, between the glutamate or aspartate at residue 190 of the 190-helix and the 9-hydroxyl group of the glycerol substituent, and between the hydroxyl group of tyrosine-98 and the 8-hydroxyl of the glycerol substituent (Fig. 2). The importance of these interactions has also been studied by analyses of site-specific mutant HAs (Martín et al. 1998; Meisner et al. 2008). Although sialic acid occupies the binding site of these complexes in very similar ways independently of the nature of the other sugar components of the receptor analogs, and essentially the same for all viruses, the positions of the other saccharides of the bound sialosides show greater positional variation (Ha et al. 2001; Gamblin et al. 2004; Russell et al. 2006; Xiong et al. 2014). Specifically, α-2,3-linked glycans extend linearly from sialic acid, leaving the binding site between the amino terminus of the 190-helix and the carboxyl terminus of the 220-loop (Fig. 2). The orientations of all avian receptors bound to avian HAs are strikingly similar. In contrast, α-2,6-linked derivatives bound to human HAs are folded over the bound sialic acid, to different degrees depending on the particular subtype of HA (Fig. 2).
As noted before, changes in the specificity and affinity of viruses for receptors occur as a consequence of mutations in and around the receptor-binding site that are selected by the pressure of growth requirements in cells with different surface sialosides during transfer of viruses between species or by the immune pressure that occurs as viruses spread during a pandemic. In the former, changes in the 190-helix and the 220-loop (Fig. 1) have been observed in early pandemic viruses compared with their putative avian precursors, and the consequences for binding specificity changes have been reported (Matrosovich et al. 2000; Ha et al. 2003).
Changes that occur during a pandemic as a consequence of antigenic drift, are an indication of the importance of the inhibition of receptor binding by antibodies, as a mechanism of neutralizing virus infectivity.
MEMBRANE FUSION
Priming by Precursor Cleavage
The structure of the translation product of the gene for HA, HA0, differs from that of HA at the site of cleavage of its subunits into HA1 and HA2 (Chen et al. 1998). Cleavage, which is required for infectivity, involves cellular proteases (Böttcher-Friebartshäuser et al. 2013; Limburg et al. 2019) and produces the carboxyl terminus of HA1 and the amino terminus of HA2. The amino-terminal 23-residues of HA2 are known as the fusion peptide (Fig. 1D).
On cleavage, the fusion peptide refolds and inserts into a charged cavity in the center of the HA trimer. Three ionizable residues are buried in the process: in Group 1 HAs, Asp-109, Asp-112, and His-111, and in Group 2 HAs, Asp-109, Asp-112, and His 17. The differences in structure between HA0 and HA as a consequence of this rearrangement are confined to 19 residues out of a total of 549 in each subunit, but they correlate with differences in trimer stability—HA0 Tm, 50°; HA Tm, 62° (Ruigrok et al. 1986; Carr et al. 1997; Skehel et al. 2008). Cleavage primes HA for activation at low pH in endosomes. The conformation of uncleaved H3 HA0 appears unresponsive to changes in pH.
Changes in HA Conformation at Low pH
Activation of HA fusion potential occurs between pH 5.0 and 6.5 depending on the strain of virus. At this low pH, HA refolds extensively and irreversibly. It becomes susceptible to proteolysis, and X-ray crystallographic analysis of proteolytic fragments or equivalent expression products support conclusions from biochemical, antigenic, and electron microscopic studies on the nature of the changes and their consequences (Bullough et al. 1994; Chen et al. 1999). They indicate that the membrane-distal subdomains of HA detrimerize but otherwise retain their neutral pH structure (Bizebard et al. 1995) and that the fusion peptide is released from its buried location to become the amino terminus of a new coiled-coil of α-helices (Fig. 1B; Bullough et al. 1994; Chen et al. 1999). Analysis of the final product of HA2 refolding shows that the fusion peptide and the membrane anchor are colocated at one end of a 110 Å-long, rod-like structure, following a 180° turn between residues HA2 106 and 111 in helix B (Fig. 1). This irreversibly refolded structure is the most stable of the different conformations of HA studied, with a thermal denaturation temperature of 80°. Neutral pH, cleaved HA with a Tm of 62° is, therefore, intermediate in stability between HA0, Tm 50° and fusion pH HA, Tm 80° (Ruigrok et al. 1986; Skehel et al. 2008; Kim et al. 2011).
EM analysis of virus fusion with liposomes (Calder and Rosenthal 2016) indicates that 200 Å-long structures formed at fusion pH by HA, bridge virus, and liposome membranes. They may be intermediates in the formation of the 110 Å rods before the 180° turn at HA2 residues 106–111 is formed. A coiled structure of this sort was initially predicted from sequence analyses (Ward and Dopheide 1980) and subsequently was suggested to be formed at low pH, from studies of synthetic peptides containing the sequence of the interhelical loop (Carr and Kim 1993). Current interest in the process of refolding is focused on the identification and characterization of possible intermediates by biophysical techniques (Garcia et al. 2015; Das et al. 2018) and by EM (Fontana and Steven 2012; Calder and Rosenthal 2016).
The fusion peptide itself has not been observed as a component of fusion pH HA fragments. However, NMR studies of 23-residue fusion peptide analogs (Lorieau et al. 2010, 2012) indicate that, in detergent solution, the peptides form hairpin-like structures of two antiparallel helices. The turn of the hairpin is at glycine-13 with the remaining conserved glycine residues (Skehel and Waterfield 1975; Cross et al. 2009) arranged as components of the inner surfaces of both helices of the hairpin (Fig. 1C). As part of fusion-active HA, the individual fusion peptides of the HA trimer may adopt a similar helical structure.
Considerations of possible requirements for HA refolding in the mechanism of membrane fusion have been based, first, on the observation that both the carboxy-terminal membrane anchor and amino-terminal fusion peptide of HA2 are colocated at one end of the fusion pH HA structure. This has led to the suggestion that HA refolding may be required to bring virus and cellular membranes close enough to each other to facilitate their fusion. Second, formation of the long coiled-coil is proposed to deliver the fusion peptide at its amino terminus to the membrane to be fused, and in its fusion-active form, the fusion peptide may directly alter the structure of the lipid bilayer to facilitate fusion (Chernomordik and Kozlov 2003; Kim et al. 2011).
From analyses of EM and X-ray crystallographic structures it is also possible to postulate that both ends of the structures, the fusion peptide and the membrane anchor subdomain, may be required to be flexibly linked to HA. The available information indicates that this may be so. The membrane anchor at the carboxyl terminus of full-length, neutral pH HA is seen in different images by cryo-EM to be flexibly linked to the ectodomain (Fig. 1; Benton et al. 2018). In the case of the 110 Å rod-like structure of HA2 (Fig. 1), which lacks the membrane anchor and the fusion peptide, the electron density maps are not interpretable between the amino terminus of the expressed construct, HA2-23, and the amino terminus of the observed trimeric coiled-coil, HA2-38 (Chen et al. 1999). This suggests that the region immediately carboxy-terminal to the fusion peptide in intact HA may also be flexible.
Antigenicity
The amino acid sequences of HAs of antigenic variants selected by growth of viruses in the presence of monoclonal antibodies (Gerhard and Webster 1978) have been used to map the molecular locations at which the mutations occur that are responsible for the changes in antigenicity. Most frequently, the mutants contain single amino acid substitutions, sometimes resulting in the introduction of an additional carbohydrate side chain (Caton et al. 1982; Skehel et al. 1984; Knossow and Skehel 2006). EM (Wrigley et al. 1983; Liu et al. 2017; Benton et al. 2018; Turner et al. 2019), and X-ray crystallographic analyses of HA in complexes with Fab fragments from monoclonal antibodies (Bizebard et al. 1995; Knossow and Skehel 2006) indicate that the positions of the substitutions define the sites of antibody binding (Fig. 6). The sites are mainly on the R subdomain (Figs. 1 and 6), which has been interpreted to indicate that the mechanisms by which the antibodies neutralize virus infectivity is either directly or indirectly, by blocking sialic acid–receptor binding (Knossow et al. 2002). The majority of antibodies of this sort that have been reported are strain-specific. However, some of them are able to prevent infection by viruses within a subtype, by binding either directly into the receptor-binding site, or to separate sites on the membrane-distal surface of R (Whittle et al. 2011; Ekiert et al. 2012; Xiong et al. 2015; Wang et al. 2019) or to sites in the interface between monomers in the HA trimer (Bangaru et al. 2019; Watanabe et al. 2019).
In contrast to complexes formed by Fabs of the subtype-specific antibodies, complexes formed by HAs with Fabs from a number of antibodies that block infections by all viruses of Group 1 or Group 2 or of both Group1 and Group 2 (Corti et al. 2017) reveal that they interact with the F subdomain near the fusion peptide (Fig. 6). In vitro these antibodies block the conformational changes in HA required for membrane fusion and can also prevent cleavage of the HA0 precursor. In these ways they may inhibit virus replication. However, the HA-antibody complexes that they form are recognized in vivo by Fc receptor–bearing immune cells that lyse infected cells (Corti et al. 2011; DiLillo et al. 2014), and this may be the mechanism of antiviral activity of this group of cross-reactive antibodies. Information gained from analyses of such antibodies, in particular those that recognize all viruses within a subtype, are valuable for research programs that have the objective of developing immunotherapies (Corti et al. 2017) or immunogens that could induce cross-reactive antibodies by vaccination (Krammer and Palese 2013) or identifying broad spectrum antiviral molecules (van Dongen et al. 2019), all to overcome the current unpredictability of influenza outbreaks and pandemics.
ACKNOWLEDGMENT
This article has been made freely available online courtesy of TAUNS Laboratories.
Footnotes
-
Editors: Gabriele Neumann and Yoshihiro Kawaoka
-
Additional Perspectives on Influenza: The Cutting Edge available at www.perspectivesinmedicine.org