SARS-CoV-2 from alpha to epsilon
As battles to contain the COVID-19 pandemic continue, attention is focused on emerging variants of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus that have been deemed variants of concern because they are resistant to antibodies elicited by infection or vaccination or they increase transmissibility or disease severity. Three papers used functional and structural studies to explore how mutations in the viral spike protein affect its ability to infect host cells and to evade host immunity. Gobeil et al. looked at a variant spike protein involved in transmission between minks and humans, as well as the B1.1.7 (alpha), B.1.351 (beta), and P1 (gamma) spike variants; Cai et al. focused on the alpha and beta variants; and McCallum et al. discuss the properties of the spike protein from the B1.1.427/B.1.429 (epsilon) variant. Together, these papers show a balance among mutations that enhance stability, those that increase binding to the human receptor ACE2, and those that confer resistance to neutralizing antibodies.
Science, abi6226, abi9745, abi7994, this issue p. 641, p. 642, p. 648
Abstract
Several fast-spreading variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) have become the dominant circulating strains in the COVID-19 pandemic. We report here cryo–electron microscopy structures of the full-length spike (S) trimers of the B.1.1.7 and B.1.351 variants, as well as their biochemical and antigenic properties. Amino acid substitutions in the B.1.1.7 protein increase both the accessibility of its receptor binding domain and the binding affinity for receptor angiotensin-converting enzyme 2 (ACE2). The enhanced receptor engagement may account for the increased transmissibility. The B.1.351 variant has evolved to reshape antigenic surfaces of the major neutralizing sites on the S protein, making it resistant to some potent neutralizing antibodies. These findings provide structural details on how SARS-CoV-2 has evolved to enhance viral fitness and immune evasion.
The COVID-19 pandemic, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) (1), has led to millions of lives lost and devastating socioeconomic disruptions worldwide. Although the mutation rate of the coronavirus is relatively low because of the proofreading activity of its replication machinery (2), several variants of concern have emerged—including the B.1.1.7 lineage first identified in the United Kingdom, the B.1.351 lineage in South Africa, and the B.1.1.28 lineage in Brazil—within a period of several months (3–5). These variants not only appear to spread more efficiently than the virus from the initial outbreak [i.e., the strain Wuhan-Hu-1; (1)] but also may be more resistant to immunity elicited by the Wuhan-Hu-1 strain after either natural infection or vaccination (6–8). The B.1.1.7 variant is of particular concern because it has been reported to be more deadly (9, 10). Thus, understanding the underlying mechanisms of the increased transmissibility, risk of mortality, and immune resistance of new variants may facilitate development of intervention strategies to control the crisis.
SARS-CoV-2 is an enveloped positive-stranded RNA virus that depends on fusion of viral and target cell membranes to enter a host cell. This first key step of infection is catalyzed by the virus-encoded trimeric spike (S) protein, which is also a major surface antigen and thus an important target for development of diagnostics, vaccines, and therapeutics. The S protein is synthesized as a single-chain precursor and is subsequently cleaved by a furin-like protease into the receptor-binding fragment S1 and the fusion fragment S2 [fig. S1 and (11)]. Binding of the viral receptor angiotensin-converting enzyme 2 (ACE2) on the host cell surface to the receptor-binding domain (RBD) of S1, together with a second proteolytic cleavage by another cellular protease in S2 [S2′ site; fig. S1 and (12)], induces dissociation of S1 and irreversible refolding of S2 into a postfusion structure, ultimately leading to membrane fusion (13, 14). In the prefusion conformation, S1 folds into four domains—NTD (N-terminal domain), RBD, and two CTDs (C-terminal domains)—that wrap around the prefusion S2 structure. The RBD can adopt two distinct conformations: “up” for a receptor-accessible state and “down” for a receptor-inaccessible state (15). Rapid progress in structural biology of the S protein has advanced our knowledge of the SARS-CoV-2 entry process (15–28). We have previously identified two structural elements, the FPPR (fusion peptide proximal region) and the 630 loop, which appear to modulate the S protein stability as well as the RBD conformation and thus the receptor accessibility (22, 28).
The S protein is the basis of almost all the first-generation COVID-19 vaccines, which were developed using the Wuhan-Hu-1 sequence (29, 30). Several have received emergency use authorization by various regulatory agencies throughout the world because of their impressive protective efficacy and minimal side effects (31, 32). These vaccines appear to have somewhat lower efficacy against the B.1.351 variant than against its parental strain (6–8, 33), and this variant became completely resistant to many convalescent serum samples in vitro (8). How to address genetic diversity has therefore become a high priority for developing next-generation vaccines. In this study, we have characterized the full-length S proteins from the B.1.1.7 and B.1.351 variants and determined their structures by cryo–electron microscopy (cryo-EM), providing a structural basis for understanding the molecular mechanisms of the enhanced infectivity of B.1.1.7 and the immune evasion of B.1.351.
Biochemical and antigenic properties of the intact S proteins from the new variants
To characterize the full-length S proteins with the sequences derived from natural isolates of the B.1.1.7 (hCoV-19/England/MILK-C504CD/2020) and B.1.351 (hCoV-19/South Africa/KRISP-EC-MDSH925100/2020) variants (fig. S1), we first transfected human embryonic kidney (HEK) 293 cells with the respective expression constructs and compared their membrane fusion activities with those of the full-length S constructs of their parental strains [Wuhan-Hu-1: D614 (Asp at position 614) and its early D614G variant: G614 (Asp-to-Gly mutation at position 614) (34)]. All S proteins expressed at comparable levels (fig. S2A), and the cells producing these S proteins fused efficiently with ACE2-expressing cells (fig. S2B). Consistent with our previous findings (22, 28), the G614 and B.1.351 variant S constructs showed slightly higher fusion activity than the D614 and B.1.1.7 variants, but the differences diminished when the transfection level increased.
To produce the full-length S proteins, we added a C-terminal strep-tag to the B.1.1.7 and B.1.351 S (fig. S3A) and expressed and purified these proteins under the conditions established for producing the D614 and G614 S trimers (22, 28). The B.1.1.7 protein eluted in three distinct peaks, representing the prefusion S trimer, postfusion S2 trimer, and dissociated S1 monomer, respectively (22), consistent with Coomassie-stained SDS–polyacrylamide gel electrophoresis (SDS-PAGE) analysis (fig. S3B). Nonetheless, the prefusion trimer was the predominant form, accounting for >70% of the total protein, indicating that this trimer is more stable than D614, where the prefusion trimer was only <25%. Like the G614 trimer (28), B.1.351 protein eluted in a single major peak, corresponding to the prefusion S trimer (fig. S3B), with no obvious peaks for dissociated S1 and S2. SDS-PAGE analysis showed that the prefusion trimer peaks contained primarily the cleaved S1/S2 complex for both the proteins, with the cleavage level moderately higher for B.1.351 than for B.1.1.7. These results indicate that the B.1.351 and G614 S proteins have almost identical biochemical properties, whereas the B.1.1.7 trimer is slightly less stable.
To assess antigenic properties of the prefusion variant S trimers, we measured their binding to soluble ACE2 and S-directed monoclonal antibodies isolated from COVID-19 convalescent individuals by bio-layer interferometry (BLI). These antibodies target various epitopic regions on the S trimer, as defined by clusters of competing antibodies and designated RBD-1, RBD-2, RBD-3, NTD-1, NTD-2, and S2 [fig. S4A; (35)]. All but the last two clusters contain neutralizing antibodies. The B.1.1.7 variant bound stronger to the receptor than did its G614 parent, regardless of the ACE2 oligomeric state (Fig. 1, fig. S4B, and table S1). The B.1.351 trimer had higher affinity for monomeric ACE2, but slightly lower affinity for dimeric ACE2, than the G614 trimer. In both cases, affinity for ACE2 of the B.1.351 protein was lower than that of the B.1.1.7 variant. These results suggest that the mutation in the RBD of the B.1.1.7 variant [N501Y (Asn501→Tyr)] enhances receptor recognition, whereas the additional mutations in the B.1.351 variant [K417N (Lys417→Asn) and E484K (Glu484→Lys)] reduce ACE2 affinity to a level close to that of the G614 protein, consistent with the previous data (36, 37). All selected monoclonal antibodies bound G614 S with reasonable affinities, and the B.1.1.7 variant showed a similar pattern but with substantially stronger binding to almost all the antibodies (Fig. 1, fig. S4B, and table S1). By contrast, the B.1.351 variant completely lost binding to the two RBD-2 antibodies, G32B6 and C12A2, as well as to the two NTD-1 antibodies, C12C9 and C83B6, whereas the affinities for the rest of the antibodies were the same as those of the G614 trimer. The BLI data were also consistent with the binding results with the membrane-bound S trimers measured by flow cytometry (fig. S5).
Antigenic properties of the purified full-length SARS-CoV-2 S proteins. BLI analysis of the association of prefusion S trimers from the G614 “parent” strain and the B.1.1.7 and B.1.351 variants derived from it with soluble ACE2 constructs and with a panel of antibodies representing five epitopic regions on the RBD and NTD [see fig. S4A and (35)]. For ACE2 binding, purified S proteins were immobilized to AR2G (Amine Reactive 2nd Generation) biosensors and dipped into the wells containing ACE2 at different concentrations. For antibody binding, various antibodies were immobilized to AHC (Anti-human IgG Fc Capture) biosensors and dipped into the wells containing each purified S protein at different concentrations. Binding kinetics were evaluated using a 1:1 Langmuir model except for dimeric ACE2 and antibody G32B6 targeting the RBD-2, which were analyzed by a bivalent binding model. The sensorgrams are in black and the fits in red. Binding constants are also summarized here and in table S1. All experiments were repeated at least twice with essentially identical results. KD, dissociation constant; n.a., not available; RU, response unit.
We next assessed the neutralization potency of the antibodies and the trimeric ACE2 construct in blocking infection of these variants in an HIV-based pseudovirus assay. For most antibodies, the neutralization potency correlated with their binding affinity for the membrane-bound or purified S proteins (fig. S6 and table S2). C81D6 and C163E6 recognize two non-neutralizing epitopes, located in the NTD and S2, respectively, and they did not neutralize any of the pseudoviruses. The B.1.1.7 virus is the most sensitive to the trimeric ACE2 and the RBD-up–targeting C63C7, suggesting that the B.1.1.7 trimer may prefer the RBD-up conformation. Thus, the detergent-solubilized S proteins adopt a physiologically relevant conformation, and mutations in B.1.351 have a greater impact on the antibody sensitivity of the virus than those in B.1.1.7.
Structures of the full-length S trimers from the B.1.1.7 and B.1.351 variants
We determined the cryo-EM structures of the full-length S trimers with the unmodified sequences of the B.1.1.7 and B.1.351 variants. Cryo-EM images were acquired on a Titan Krios electron microscope equipped with a Gatan K3 direct electron detector. We used RELION (38) for particle picking, two-dimensional (2D) classification, 3D classification, and refinement (figs. S7 to S10), and cryoSPARC (39) for validation. 3D classification identified five distinct classes for the B.1.1.7 S trimer—representing one closed prefusion conformation, three one-RBD-up conformations, and one two-RBD-up conformation—and two different classes for the B.1.351 trimer, representing a closed conformation and a one-RBD-up conformation. These structures were refined to 2.9- to 4.3-Å resolution (figs. S7 to S10 and table S3).
The overall architectures of the full-length variant S proteins are very similar to that of the G614 S trimer in the corresponding conformation [figs. S11 and S12; (28)]. In the closed, three-RBD-down structure, the four domains of S1—NTD, RBD, CTD1, and CTD2—wrap around the prefusion S2 trimer. In the one-RBD-up conformation, the RBD position has no effect on the central core region of S2, but two NTDs, the immediately adjacent one and the one from the same protomer, shift away from the threefold axis and open up the trimer. The furin cleavage site at the S1/S2 boundary (residues 682 to 685) in these structures remains disordered, and the structures therefore cannot explain the difference in the cleavage level between the B.1.1.7 and B.1.351 trimers; the position of a substitution [P681H (Pro681→His)] in the B.1.1.7 S (fig. S1) close to the cleavage site is likewise not visible. A small class of particles in the two-RBD-up conformation was present only with the B.1.1.7 trimer (fig. S11), possibly because B.1.1.7 S1 is less likely to dissociate.
For the B.1.1.7 S trimer, most particles used for refinement were in the RBD-up conformation (Fig. 2, A to E). We have proposed that the FPPR (residues 828 to 853) and 630 loop (residues 620 to 640) modulate the stability and fusogenic structural rearrangements of the S protein (22, 28). In the closed conformation of the B.1.1.7 trimer, all three FPPR and three 630 loops are disordered (Fig. 2F), which otherwise would help clamp down the RBDs. This explains why the B.1.1.7 trimer is more likely than its parental G614 variant to populate the RBD-up conformation, because the FPPRs and 630 loops are structured in the G614 trimer (28). In the one-RBD-up conformation, one 630 loop on the opposite side of the up RBD becomes fully structured, inserting between neighboring NTD and CTDs in the same configuration found in the G614 trimer (28). The second 630 loop is partially ordered, whereas the third one remains disordered. A similar pattern is found for three FPPRs, although the structured FPPR adopts a conformation distinct from the one seen in our previous structures of the full-length S proteins (22, 28). Overall, the arrangement of these structural elements appears to stabilize the cleaved S trimer and to prevent the premature S1 dissociation in the one-RBD-up conformation. The three one-RBD-up structures differ only by the degree to which the up RBD and the adjacent NTD of its neighboring protomer shift away from the central threefold axis (fig. S13A). We have suggested that the two-RBD-up conformation might be unstable (22, 28), leading to S1 dissociation and irreversible S2 refolding. If this suggestion is valid, the small class of the two-RBD-up particles probably contains mainly uncleaved S trimers.
(A to E) The structures of the closed prefusion conformation (A), three one-RBD-up conformations [(B) to (D)], and a two-RBD-up conformation (E) of the B.1.1.7 S trimer are shown in ribbon diagram, with one protomer colored as NTD in blue, RBD in cyan, CTD1 in green, CTD2 in light green, S2 in light blue, the 630 loop in red, and the FPPR in magenta. (G and H) The structures of the closed prefusion conformation (G) and one-RBD-up conformation (H) of the B.1.351 S trimer are shown in ribbon diagram with the same color scheme as in (A) to (E). All mutations in the new variants, as compared with the original virus (D614), are highlighted as sphere models. (F and I) Structures, in the B.1.1.7 trimer, of segments (residues 617 to 644) containing the 630 loop (red) and segments (residues 823 to 862) containing the FPPR (magenta) from each of the three protomers (a, b, and c). The position of each RBD is indicated. Dashed lines indicate gaps in the chain trace (disordered loops).
The two classes for the B.1.351 S trimer represent the closed prefusion and one-RBD-up states, respectively (Fig. 2, G and H). The configurations of the FPPR and 630 loop closely follow the distribution seen in the G614 trimer: All are structured in the RBD-down conformation, whereas only the FPPR and 630-loop pair is ordered in the one-RBD-up conformation [fig. S12; (28)]. These observations are consistent with the similar biochemical stabilities of the B.1.351 and G614 S trimers [fig. S3; (28)].