Abstract
Free full text
PNAS Plus
Effects of protein size, thermodynamic stability, and net charge on cotranslational folding on the ribosome
Significance
There is increasing interest to understand how proteins fold as they are synthesized on the ribosome. However, we still lack basic knowledge such as how protein size, net charge, and thermodynamic stability impact cotranslational folding. Here, we have studied eight proteins of increasing size and provide a comprehensive picture of how the location in the ribosome exit tunnel where a protein folds correlates with protein size. Moreover, we demonstrate that the force exerted on the nascent chain by protein folding varies linearly with the thermodynamic stability of the folded state, and that the ribosome environment disfavors folding of domains of high net-negative charge. These findings establish important basic facts about cotranslational folding.
Abstract
During the last five decades, studies of protein folding in dilute buffer solutions have produced a rich picture of this complex process. In the cell, however, proteins can start to fold while still attached to the ribosome (cotranslational folding) and it is not yet clear how the ribosome affects the folding of protein domains of different sizes, thermodynamic stabilities, and net charges. Here, by using arrest peptides as force sensors and on-ribosome pulse proteolysis, we provide a comprehensive picture of how the distance from the peptidyl transferase center in the ribosome at which proteins fold correlates with protein size. Moreover, an analysis of a large collection of mutants of the Escherichia coli ribosomal protein S6 shows that the force exerted on the nascent chain by protein folding varies linearly with the thermodynamic stability of the folded state, and that the ribosome environment disfavors folding of domains of high net-negative charge.
More than 40 y ago Anfinsen postulated that “the native structure is determined only by the protein’s amino acid sequence” (1). His work established the protein folding field, which up until recently has focused almost exclusively on biophysical studies of purified proteins that can be reversibly unfolded/refolded in vitro. However, it is becoming increasingly clear that many proteins start to fold cotranslationally, that is, while still being synthesized on the ribosome (2–6); elements of secondary structure or small protein domains may even fold completely inside the ribosome exit tunnel (7–13). In contrast to in vitro folding, cotranslational folding is still a poorly understood process (14), and we lack basic information such as how protein size and net charge relate to where in the exit tunnel a protein starts to fold, and how protein stability impacts cotranslational folding.
Here, using arrest peptides (APs) as force sensors (15, 16) and on-ribosome pulse proteolysis (17), we have analyzed the cotranslational folding of eight protein domains that display cooperative folding in vitro. The domains are of different size and fold type, of different thermodynamic stability, and of different net charge. We find direct correlations between protein size and the location in the ribosome exit tunnel at which a protein folds, and between thermodynamic stability and the pulling force generated on the nascent chain during folding. Further, it appears that nascent chain segments with high net-negative charge are pushed out of the negatively charged ribosome exit tunnel before they fold. These findings establish important basic facts about cotranslational folding and reinforce the view of the exit tunnel as an environment that can have a strong impact on protein folding.
Results
Folding Assays.
We have used two assays to follow cotranslational folding of protein domains: an AP-based assay that makes it possible to detect the tension generated in the nascent chain when a domain folds (8) and an on-ribosome pulse-proteolysis assay where thermolysin resistance is used as an indicator of folding (17).
Translational APs are short stretches of polypeptide that interact with the upper parts of the ribosome exit tunnel in such a way that they stall translation when the ribosome reaches the last codon in the AP (18). APs are sensitive to external forces pulling on the nascent chain (19), and stalling efficiency is reduced in proportion to the external pulling force (20, 21). Hence, APs can be used as force sensors to report on cotranslational processes such as protein translocation (22), membrane protein biogenesis (21, 23), and protein folding (8, 15, 20, 24).
The basic construct used in the AP cotranslational folding assay is composed of the following elements (Fig. 1A): (i) the protein domain to be studied, (ii) a variable-length stretch of alternating glycine and serine (GS) residues, (iii) the AP, and (iv) a 23-residue C-terminal extension that makes it possible to separate arrested and full-length protein products by SDS/PAGE. We define linker length L as the number of residues between the protein domain and the end of the AP. As illustrated in Fig. 1B, for small L (short linkers) there is not enough space in the ribosome exit tunnel to allow folding at the point during translation when the ribosome reaches the end of the AP, and little force is exerted on the AP. At some intermediate value of L, the chain is just long enough to allow folding if stretched, and the force on the AP is increased. At large L, the domain is already folded when the ribosome reaches the codon at end of the AP, and again little force is generated. Since APs stall translation less efficiently when under tension (18, 20, 21), the fraction full-length protein (fFL) produced for constructs with different L reflects the variation in force generated on the AP by the folding reaction, and a plot of fFL vs. L can be used to infer where in the exit tunnel folding occurs during translation (15). Mutagenesis studies (16, 25), as well as visualization of folded protein domains located within the exit tunnel by cryo-EM (8, 15, 16, 24) and molecular dynamics simulations (8, 24), show that the dominant peak in a fFL profile corresponds to folding into the native state (as opposed to, e.g., nonspecific compaction of the nascent chain), at least for small, single-domain proteins; further support for this notion is provided below. fFL profiles were recorded for each protein by in vitro translation in the PURE (Protein synthesis Using Recombinant Elements) system (26), separation of arrested (A) and full-length (FL) forms of the protein by SDS/PAGE, and quantitation of the A and FL bands by phosphoimager analysis.
The pulse-proteolysis assay (17) is based on the premise that a short exposure of a stalled ribosome-nascent chain complex to thermolysin will lead to degradation of the protein if it is partly or wholly unfolded, but not if it is folded (Fig. 1C). Since thermolysin cleaves only on the N-terminal side of hydrophobic residues (27), the (GS)n linker will not be digested. Like the AP assay, pulse proteolysis reports on folding into the native state rather than nonspecific compaction of the nascent chain (17, 28). Examples of SDS/PAGE gels for both assays are shown in SI Appendix, Fig. S1.
The Onset of Cotranslational Folding Correlates with Domain Size.
To put the possible relation between protein size and the location in the ribosome exit tunnel where folding commences (4, 7) on a firm quantitative footing, we selected eight protein domains of different folds and with sizes ranging from 28 to 128 residues for study (SI Appendix, Tables S1 and S2): the de novo-designed protein full sequence design 1 (FSD-1) (28 residues) (29), a WW domain (35 residues) (30), the de novo-designed protein EHEE_rd2_0005 (40 residues) (31), protein G (56 residues) (32), a calmodulin fragment (69 residues) (33), ribosomal protein S6 (101 residues) (34), superoxide dismutase 1 (SOD1, 110 residues) (35), and ileal binding protein (ILBP, 128 residues) (36). We used the moderately strong 17-residue Escherichia coli SecM(Ec) AP for all eight proteins (37); in addition we also used the stronger eight-residue SecM(Ms) AP from Mannheimia succiniciproducens (38) for protein S6. All fFL profiles are shown in Fig. 2.
The fFL profile for FSD1 (a designed 28-residue αββ zinc-finger domain that does not require Zn2+ to fold) has a maximum already at Lmax = 17 residues (Fig. 2A), significantly different from the fFL profile previously obtained for the 29-residue zinc-finger protein ADR1a that has Lmax ≈ 25 residues (8). It has been suggested that the α-helix in FSD1 is more rigid than the β-hairpin, and there is low cooperativity between the α-helix and the β-hairpin during folding (39). Circular dichroism and thermal denaturation suggest that the melting temperature of FSD1 mainly reflects the melting of the α-helix (40). As stretches of α-helix can form deep in the exit tunnel (13, 41, 42), it seems likely that the high fFL value at L = 17 residues reflects the formation of the α-helix rather than folding of the whole domain. To confirm that a folding event is responsible for the peak we introduced a point mutation (F21P) that breaks the α-helix in the L = 17 residues construct; this caused fFL to drop (from 0.77 to 0.47, red data point), as expected.
The WW domain is one of the smallest known β-sheet proteins that can fold autonomously (43). The fFL profile for the 35-residue WW domain (an antiparallel three-stranded β-sheet) peaks at Lmax = 23–25 residues and the protein starts to fold at Lonset ≈ 22 residues (Fig. 2B), in line with previous findings (7). Introduction of the mutation Y19A that reduces the thermodynamic stability of the WW domain (30) caused a reduction of fFL at L = 25 residues from 0.60 to 0.39 (red data point), as expected if the peak reflects a folding event.
The 40-residue protein EHEE_rd2_0005 (βαββ fold) is an artificial miniprotein produced by two rounds of design and selection, and claimed to be “the most stable minimal protein ever found (lacking disulfides or metal coordination)” (31). The peak of the force profile has the highest amplitude of all of the tested proteins, with fFL = 1 between L = 28–40 residues (Fig. 2C). Folding starts at Lonset ≈ 24 residues.
The fFL profile for the 56-residue protein G (ββαββ fold) (44) has a pronounced peak that starts at Lonset ≈ 28 residues and reaches a maximum at Lmax ≈ 35 residues (Fig. 2D). To confirm that the peak is due to folding we introduced the destabilizing mutation F52L (45) that decreased fFL from 0.7 to 0.3 at L = 30 residues (red data point).
Wild-type calmodulin contains four Ca2+-binding sites in the so-called EF-hands EF-1 to EF-4. Each EF-hand folds into a helix–loop–helix motif; the 69-residue version of the protein tested here contains only the EF-2 and EF-3 domains (33). This particular version of calmodulin requires Ca2+ to fold and has a well-defined hydrophobic core composed mainly of aromatic residues (33). When translated in the presence of 3 mM calcium chloride, calmodulin EF-2–EF-3 displays a peak in the fFL profile with Lonset ≈ 34 residues and at Lmax ≈ 40 residues (Fig. 2E). No comparable peak is seen when the translation reaction is supplemented with the Ca2+ chelator EGTA (0.3 mM), validating folding as the source of the peak.
Ribosomal protein S6 from Thermus thermophilus is a 101-residue βαββαβ protein with a ferredoxin-like fold (46). It has been subjected to multiple in vitro folding studies (47–52). The fFL profile of S6 has a peak with Lonset ≈ 27 residues and Lmax ≈ 28 residues (Fig. 2F). We also obtained an fFL profile with the stronger SecM(Ms) AP and determined the fFL profile at single-residue resolution around Lmax. With the stronger AP, the peak in the fFL profile is better defined, with Lonset = 28 residues and Lmax = 29 residues, and returns to a low baseline of fFL ≈ 0.3 at long linker lengths.
SOD1 plays an important role in neurodegenerative disease (53), and its misfolding and aggregation due to demetalation has been linked to amyotrophic lateral sclerosis. The version of SOD1 tested here has been through a process of loop removal, leaving a β-barrel that does not bind copper or zinc but still folds cooperatively into a globular domain (35). The SOD1 fFL profile has a major peak at Lmax ≈ 45 residues (Fig. 2G). We further studied the folding of SOD1 using pulse proteolysis (17). The protein acquires full thermolysin resistance (as measured by the fraction protease-resistant arrested form of the protein, fTR) at L ≈ 40 residues, coincident with the onset of folding at Lonset ≈ 38 residues seen in the fFL profile.
The ILBP is the biggest protein analyzed in this study (128 residues). It shows a significant increase in fraction full length at Lonset ≈ 38 residues and a maximum at Lmax = 45–50 residues (Fig. 2H). As was observed with SOD1, thermolysin resistance reaches its maximum coincident with the onset of folding seen in the fFL profile. We also tested a variant of ILBP that is unable to fold without ligand (36), in which the two helices that cap the β-barrel were removed and replaced by a small GSGS linker (depicted in SI Appendix, Fig. S1A). In the absence of ligand, the fFL profile of this variant remains constant throughout the range L = 20–60 residues (SI Appendix, Fig. S1D, red curve). In contrast, upon ligand binding the mutant protein displays an fFL profile similar to that of the wild-type protein, but with somewhat lower amplitude (SI Appendix, Fig. S1D, gray curve). Notably, in the presence of ligand the mutant protein has a lower melting temperature (Tm = 41.2 °C) than the apo wild-type ILBP (Tm = 60.5 °C) (54), supporting the idea that the amplitude of the fFL peak correlates with the thermodynamic stability of the protein.
One notable feature of all of the fFL profiles obtained with the SecM(Ec) AP in Fig. 2 is the rather high fFL values for the longest linker lengths compared with previously reported fFL profiles (8, 15, 24). The most obvious difference between the constructs analyzed here and those analyzed before is that we here use a flexible (GS)n linker (to ensure resistance to thermolysin cleavage) rather than a linker with more variable amino acid composition derived from the E. coli LepB protein. We therefore substituted the (GS)n linker with the previously used LepB-derived linker in the FSD-1 [L = 61] construct. Indeed, fFL was reduced to 0.03, much lower than the value of ~0.5 seen for large L values with the (GS)n linker (Fig. 2A, blue arrow). The relatively high limiting fFL values at large L values are thus caused by the linker. As seen in the fFL profile for calmodulin EF-2–EF-3 obtained in the absence of Ca2+ (Fig. 2E, red curve), the rise in fFL values caused by the linker becomes noticeable at L ≈ 40 residues [when the (GS)n segment is ~25 residues long] and hence does not influence the calculated Lonset values (that are all ≤40 residues). An increase in fFL values at a similar linker length is evident also in Fig. 2 A and B.
In summary, we find strong correlations between protein size and the distance from the peptidyl transferase center at which the protein folds and between the onset of folding as determined from fFL and fTR profiles.
fFL Correlates with Thermodynamic Stability.
If fFL profiles indeed reflect cotranslational folding to the native state, their amplitude at L = Lmax should reflect the thermodynamic stability of the folded state. The S6 protein provides a good model system to perform this consistency check, since both thermodynamic stability (ΔGD-N) and folding/unfolding rate constants (kf and ku, respectively) have been measured for a large number of point mutants (51). We therefore measured fFL at linker length L = 30 residues, that is, at the middle of the peak of the fFL profile for wild-type S6 obtained with the strong SecM(Ms) AP, for the 16 S6 mutants listed in SI Appendix, Table S3. As seen in Fig. 3, there is a good correlation between fFL and ΔGD-N. A multiple regression analysis including both ΔGD-N and log kf values did not yield a statistically significant improvement in the correlation (P = 0.063, H0 accepted at the 5% level). Folding kinetics thus does not appear to play a significant role in determining fFL values under our experimental conditions. However, since the translation rate is ~10-fold slower in PURE than in vivo (55), further studies will be required to fully address this last point.
The observations that the appearance of the main peak in the S6 (discussed below), SOD1, and ILBP fFL profiles correlates with the acquisition of thermolysin resistance in the on-ribosome pulse-proteolysis assay, and that the amplitude of the main peak in the S6 fFL profile correlates with the thermodynamic stability of the protein, reinforce the notion that the main peak in an fFL profile represents a bona fide folding reaction rather than, for example, the formation of a nonspecific collapsed state. The sharp onset of the main peak observed for most proteins analyzed in Fig. 2 and in earlier studies (8, 24, 25) provides further support that fFL profiles reflect cooperative folding to the native state.
Net Charge and Cotranslational Folding.
Finally, to determine how the net charge of a folded protein domain affects cotranslational folding on the ribosome, we compared the fFL profiles for wild-type S6 and four mutants of different net charge, three of which have previously been characterized in vitro in terms of ΔGD-N, log kf, and log ku (52) (Fig. 4 and SI Appendix, Table S4). Wild-type S6 contains 16 positively charged and 16 negatively charged residues (zero net charge). The S60 variant also has zero net charge but lacks all charged residues. The fFL profile of S60 is broadly similar to that of wild-type S6, but the peak is shifted to somewhat larger L values (Fig. 4C). The fFL profile of the positively charged variant S6+16–9 is also similar to the profile for wild-type S6 (Fig. 4D), except that the peak is sharper. The in vitro folding studies (52) have shown that a mutant that lacks all negatively charged residues (S6+17) has slightly accelerated folding and unfolding rates and somewhat lower thermodynamic stability compared with wild-type S6 (SI Appendix, Table S4); presumably, the corresponding ΔGD-N, log kf, and log ku values for S6+16–9 are even closer to the wild-type S6 values, explaining the similarity between the two fFL profiles. S60 has a higher unfolding rate than S6+17 and a correspondingly lower thermodynamic stability, consistent with the somewhat higher Lonset and lower amplitude of the fFL peak.
In contrast, removal of half of the positive charges from wild-type S6 (variant S6+9–16) causes the peak in the fFL profile to shift from Lmax ≈ 28 to Lmax ≈ 33 residues (Fig. 4E) and leads to a general increase in fFL throughout the profile. Removal of the remaining positively charged residues yields the supercharged but not very stable variant S6−16; in this case, fFL ≥ 0.7 for all L values, and the profile has a broad maximum around Lmax ≈ 50 residues (Fig. 4F).
We also recorded the fFL and fTR profiles for wild-type S6 and the supercharged S6−16 variant using the stronger SecM(Ms) AP (38). The fTR profile for wild-type S6 mirrors the fFL profile, starting to increase at L ≈ 27 residues and plateauing at Lmax ≈ 32 residues (Fig. 4G). In contrast, the S6−16 fTR profile starts out moderately high at small L values, drops to a lower value at L = 30–45 residues, and finally increases again to reach a moderately high plateau at L > 55 residues (Fig. 5H); the fFL profile has no distinct peak, similar to the profile obtained with the SecM(Ec) AP but at lower fFL values (compare Fig. 5F).
The relatively high thermolysin resistance of S6−16 at small L values is surprising. To analyze this further, we introduced the destabilizing mutation L10A in both wild-type S6 and S6−16 at linker lengths L = 17 and L = 57 residues. This mutation reduces ΔGD-N for wild-type S6 from 8.97 to 4.62 kcal/mol (51). The destabilization of S6−16 caused by the mutation has not been measured, but the folding free energy of S6−16 itself is only 2.89 kcal/mol (52). As seen in Fig. 4 G and H (red triangles), fTR of wild-type S6 is only marginally affected by the mutation—as expected, since even the mutated protein is relatively stable—but is strongly reduced for S6−16 at both linker lengths.
We conclude that S6 versions that contain many positively charged residues (wild-type and S6+17–9) start to fold well inside the exit tunnel, while S60 and S6+9–16 fold near the distal end of the tunnel. For the negatively charged versions S6+9–16 and S6−16 fFL values are high for all linker lengths despite their low thermodynamic stability, likely reflecting a strong electrostatic repulsion between the negatively charged exit tunnel and the nascent chain (56–58). There is a hint from the fFL and fTR profiles that the supercharged S6−16 version may fold outside the exit tunnel, around L ≈ 50 residues, but the data are not conclusive on this point.
Discussion
In this study, we have focused on understanding how three central protein characteristics—size, thermodynamic stability, and net charge—affect cotranslational protein folding.
Our findings on the relation between domain size and the location in the ribosome exit tunnel where folding commences are summarized in Fig. 5, which includes data not only from the present study but also from previous studies on ADR1a (8), three spectrin domains (15), a penta-repeat β-helix protein (25), DHFR (59), the I27 (24) and FNL5 (60) Ig-domain proteins, and the designed protein Top7 (20) (see SI Appendix, Table S5 for details). From cryo-EM structures of ADR1a (8), spectrin R16 (15), and titin I27 (24) ribosome-nascent chain complexes we know that a ~35-residue linker is required to span the distance between the P-site and a location at the distal end of the exit tunnel, near a prominent loop in the ribosomal protein uL24 (gray shaded area in Fig. 5).
Small domains (≤70 residues in size) invariably start to fold while still located inside the tunnel (Lonset < 35 residues), and the smaller they are the deeper in the tunnel they fold. Large domains tend to fold at the distal end of, or even outside, the tunnel (Lonset > 35 residues), but, depending on the location of the first accessible folding nucleus in the structure, may also start to fold while still inside the tunnel, as exemplified by the spectrin domains (15). A linear fit of log10(Lonset) vs. log10(size) (SI Appendix, Fig. S3) shows that Lonset scales approximately with the square root of the protein size over the range covered here (dotted line in Fig. 5). Fig. 5 provides the most complete picture to date of how the gradual increase in tunnel dimensions from the so-called constriction site (~25 Å from the peptidyl transferase site) to the tunnel exit allows protein domains of increasing size to fold.
For the largest proteins analyzed, SOD1 and ILBP, in addition to the main folding peak at L ≈ 45–50 residues, we see an early increase in fFL at L ≈ 25–30 residues that might be indicative of a folding intermediate (Fig. 2 G and H). A similar increase in fFL at small L values was seen previously for the 189-residue protein DHFR (59). In vitro, SOD1 does not seem to fold via an intermediate state but displays two-state behavior (35). Further studies will be necessary to characterize these putative folding intermediates.
Cotranslational folding of the exceptionally stable 40-residue protein EHEE_rd2_0005 generates sufficient force on the AP to yield fFL = 1.0 over the whole range L = 28–40 residues (Fig. 2C), setting it apart from the other proteins studied here. This and previous observations (16, 21) suggest that the amplitude of the peak in the fFL profile reflects the thermodynamic stability of the folded protein. Indeed, there is a good correlation between fFL and ΔGD-N for a large set of S6 mutants (Fig. 3). Since different proteins fold in different locations inside or outside the exit tunnel, and presumably push against the ribosome in different ways during folding, we do not expect the precise quantitative relation between fFL and ΔGD-N found for S6 (discussed above) to hold for other proteins, but only that the general form of the correlation applies.
Our results for protein S6 variants of different charge content show that protein net charge can have a dramatic effect on the fFL profile (Fig. 4). In general, the more positive the net charge, the deeper in the exit tunnel S6 folds. We further note that the fFL profile for the fully uncharged S60 variant is similar to that of wild-type S6, demonstrating that a protein that is totally devoid of charged residues can fold in the exit tunnel. This extends previous results demonstrating efficient folding in vitro of a protein completely devoid of charged residues (52, 61) to cotranslational folding. Strikingly, for the negatively charged variants S6+9–16 and S6−16, high values of fFL persist to much larger L values than for the neutral and positively charged variants, the peaks in the fFL profiles are broad and indistinct, and the maximum amplitudes are high, in contrast to what one would expect from the significantly lower thermodynamic stability of S6−16 (SI Appendix, Table S4). Given the net-negative charge of the exit tunnel (56, 58, 62), we hypothesize that the high fFL values seen for S6+9–16 and S6−16 are caused by negatively charged residues in the nascent chain being pushed out of the exit tunnel by electrostatic forces even before the protein folds, consistent with ribosome-profiling data showing that stretches of nascent chain of high net-negative charge show reduced ribosome densities (63).
More generally, the fact that domains of up to ~70 residues in size can fold inside the exit tunnel and therefore should be less likely to aggregate or require chaperones for folding might favor the evolution of proteins composed of small (sub)domains that can initiate early folding inside the ribosome. In support of such a scenario, statistical analyses have identified evolutionarily conserved clusters of rare codons located some 20–50 codons downstream of independently folding subdomains that may facilitate their cotranslational folding by slowing down translation (64, 65).
Finally, the observation that cotranslational folding generates force on the nascent chain raises the possibility that new sensor systems composed of a ligand binding domain coupled to an AP may be found in nature, or be constructed for biosensing purposes. In principle, the calmodulin EF-2–EF-3 and ILBP constructs analyzed here could be used as Ca2+ and glycochenodeoxycholic acid sensors, respectively, if fused to, for example, GFP (7).
Materials and Methods
Enzymes and Chemicals.
Enzymes were purchased from Thermo Scientific and New England Biolabs. Thermolysin from Geobacillus stearothermophilus was purchased from Sigma-Aldrich. Oligonucleotides were purchased from Eurofins MWG Operon. DNA/RNA purification kits were obtained from Qiagen. The New England Biolabs PURExpress In Vitro Protein Synthesis Kit was purchased from BioNordika. [35S]methionine was purchased from PerkinElmer. All other reagents were from Sigma-Aldrich.
DNA Manipulations.
The genes of the FSD-1, Pin1 WW domain (Homo sapiens), and EHEE design were assembled from oligonucleotides (66). The gene for protein G (Streptococcus sp.) was purchased from Genewiz. The wild-type calmodulin gene (H. sapiens) was kindly donated by Sara Linse, Lund University, Lund, Sweden and modified by an adaptation of the original QuikChange site-directed mutagenesis protocol (67) to obtain the fragment tested in this study. SOD1 (H. sapiens) gene (35) was kindly donated by Mikael Oliveberg, Stockholm University, Stockholm. The gene for the ILBP from rabbit (Oryctolagus cuniculus) was purchased from Invitrogen and was truncated by an adaptation of the original QuikChange site-directed mutagenesis protocol (67).
The gene for ribosomal protein S6 from T. thermophilus, along with the genes for the five of its mutants (52), were provided by Mikael Oliveberg.
The constructs were cloned in a pET19b vector under the control of a T7 promoter. As shown in SI Appendix, Table S2, the small domain (FSD-1, Pin1 WW domain, and EHEE) constructs contained a sequence corresponding to an N-terminal unstructured fragment of 158 residues from E. coli LepB (21) (to facilitate visualization by SDS/PAGE), a sequence corresponding to a (GS)n flexible linker of 43 or 52 residues (to reach a 60-residues-long tether length together with the AP), the sequence corresponding to the SecM AP from E. coli (FSTPVWISQAQGIRAGP) (68) [referred to here as SecM(Ec)] or M. succiniciproducens (HAPIRGSP) (69) [referred to here as SecM(Ms)] and a sequence corresponding to a 23-residues-long LepB-derived C-terminal fragment (to distinguish between full-length and arrested version of proteins after SDS/PAGE). The genes were inserted into the expression vector by Gibson assembly (70). Constructs with increasingly shorter tether length starting from the N-terminal part of the flexible linker and control constructs for the full-length [SecM(Ec) FSTPVWISQAQGIRAGA, SecM(Ms) HAPIRGSA] and arrested [SecM(Ec): FSTPVWISQAQGIRAG*, SecM(Ms): HAPIRGS*, where * is a stop codon] forms of the proteins were generated by an adaptation of the original QuikChange site-directed mutagenesis protocol (67). PCR amplified products were subjected to DpnI digestion and purified with GeneJET PCR purification Kit or GeneJET Gel Extraction Kit before Gibson assembly. E. coli DH5α competent cells were transformed and plated onto LB agar plate supplemented with carbenicillin.
Single colonies were picked to inoculate LB supplemented with ampicillin and were incubated for 12–18 h at 37 °C with shaking. GeneJET Plasmid Miniprep Kit was used for plasmid isolation and purification. All constructs were verified by sequencing (Eurofins Genomics).
In Vitro Transcription and Translation.
Linear constructs were obtained by PCR amplification using oligonucleotides overlapping the T7 promoter and terminator on purified plasmids. After DpnI digestion and purification (GeneJET PCR Purification Kit; Thermo Scientific), the linear constructs were used as template for protein expression. The NEB PURExpress In Vitro Protein Synthesis Kit was used for in vitro transcription and translation. The reaction mix was supplemented with [32S]methionine and the synthesis of the radiolabeled proteins was performed at 37 °C, for 15 min, under constant shaking at 700 rpm in an Eppendorf 5417R thermomixer. The reaction was stopped by addition of 1:1 volume of 10% ice-cold TCA and the samples were incubated on ice for at least 30 min. After centrifugation at 20817 × g for 5 min at 4 °C, the supernatant was discarded and the pellet was resuspended in sample buffer at 37 °C for 15 min, under constant shaking at 900 rpm. The samples were treated with RNase I (670 μg/mL) at 37 °C for 30 min and were subsequently resolved by SDS/PAGE. All expressions were performed in triplicates.
On-Ribosome Pulse Proteolysis.
After synthesis of radiolabeled proteins as described in the previous section, the reactions were stopped by the addition of chloramphenicol (instead of TCA) at a final concentration of 3.3 mM. Each sample was divided into two aliquots of equal volume, one for pulse proteolysis and the other serving as a nontreated control. Pulse proteolysis was performed by treatment with thermolysin (buffered in 50 mM Tris and 500 μM ZnCl2, pH 7.0) at a final concentration of 0.75 mg/m for 1 min at 37 °C, under constant shaking at 700 rpm. The same conditions were applied for the nontreated controls with the difference that thermolysin was not included in the buffer solution. The reaction was stopped by addition of 3 μL 500 mM EDTA (pH 8.4), and protein precipitation was performed by addition of 1:1 volume of ice-cold 10% TCA and incubation on ice for 30 min. As described in the previous section, after centrifugation at 20817 × g for 5 min, at 4 °C, the supernatant was discarded and the pellet was resuspended in sample buffer at 37 °C for 15 min, under constant shaking at 900 rpm. The samples were supplemented with RNase I (670 μg/mL) at 37 °C, for 30 min, and were subsequently resolved by SDS/PAGE. All pulse-proteolysis assays were performed in triplicate.
Gel Electrophoresis and Quantitation.
Proteins were resolved by SDS/PAGE and visualized on a Fuji FLA-3000 phosphoimager. For fFL measurements, the bands were quantified to estimate the fraction full-length protein fFL = IFL/(IFL + IA), where IFL is the intensity of the band corresponding to the full-length protein and IA is the intensity of the band corresponding to the arrested form of the protein, fFL.
For pulse-proteolysis measurements the thermolysin-resistant fraction was calculated as fTR = IATR/IABuff, where IATR is the intensity of the arrested band after pulse proteolysis and IABuff is the intensity of the nontreated control band. Bands were quantitated using ImageJ (https://imagej.nih.gov/ij) to obtain an intensity cross-section, which was subsequently fit to a Gaussian distribution using in-house software.
Homology Modeling of ILBP Mutant.
The structure [Protein Data Bank (PDB) ID code 2lba] of chicken ILBP protein in complex with glycochenodeoxycholic acid (GDCG) was used to produce a homology model (71) of the truncated mutant (ILBP-tm) protein used in this study (59.6% Seq. Id with 2lba PDB template). We show the estimated position of the ligand (GDCG) in stick representation in SI Appendix, Fig. 1A.
Acknowledgments
We thank Mikael Oliveberg, Jens Danielsson, Setareh Arya, and Domhnall Iain Henderson for discussions. This work was supported by grants from the Knut and Alice Wallenberg Foundation, the Swedish Cancer Foundation, and the Swedish Research Council (to G.v.H.).
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/10.1073/pnas.1812756115/-/DCSupplemental.
References
Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences
Full text links
Read article at publisher's site: https://doi.org/10.1073/pnas.1812756115
Read article for free, from open access legal sources, via Unpaywall: https://www.pnas.org/content/pnas/115/40/E9280.full.pdf
Citations & impact
Impact metrics
Article citations
Resolving chaperone-assisted protein folding on the ribosome at the peptide level.
Nat Struct Mol Biol, 10 Jul 2024
Cited by: 2 articles | PMID: 38987455
Heat-induced structural and chemical changes to a computationally designed miniprotein.
Protein Sci, 33(6):e4991, 01 Jun 2024
Cited by: 1 article | PMID: 38757381
Systematic Investigation of the Trafficking of Glycoproteins on the Cell Surface.
Mol Cell Proteomics, 23(5):100761, 07 Apr 2024
Cited by: 0 articles | PMID: 38593903 | PMCID: PMC11087972
Unveiling the Potential of B3O3 Nanoflake as Effective Transporter for the Antiviral Drug Favipiravir: Density Functional Theory Analysis.
Molecules, 28(24):8092, 14 Dec 2023
Cited by: 0 articles | PMID: 38138581 | PMCID: PMC10746011
Is Posttranslational Folding More Efficient Than Refolding from a Denatured State: A Computational Study.
J Phys Chem B, 127(21):4761-4774, 18 May 2023
Cited by: 1 article | PMID: 37200608 | PMCID: PMC10240488
Go to all (34) article citations
Data
Data behind the article
This data has been text mined from the article, or deposited into data resources.
BioStudies: supplemental material and supporting data
Protein structures in PDBe (2)
-
(1 citation)
PDBe - 2kjvView structure
-
(1 citation)
PDBe - 2lbaView structure
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Quantitative determination of ribosome nascent chain stability.
Proc Natl Acad Sci U S A, 113(47):13402-13407, 07 Nov 2016
Cited by: 51 articles | PMID: 27821780 | PMCID: PMC5127326
A switch from α-helical to β-strand conformation during co-translational protein folding.
EMBO J, 41(4):e109175, 07 Jan 2022
Cited by: 16 articles | PMID: 34994471 | PMCID: PMC8844987
The ribosome destabilizes native and non-native structures in a nascent multidomain protein.
Protein Sci, 26(7):1439-1451, 19 May 2017
Cited by: 29 articles | PMID: 28474852 | PMCID: PMC5477528
Folding of a nascent peptide on the ribosome.
Prog Nucleic Acid Res Mol Biol, 66:41-66, 01 Jan 2001
Cited by: 61 articles | PMID: 11051761
Review
Funding
Funders who supported this work.
Cancerfonden (1)
Grant ID: 15 0888
Knut och Alice Wallenbergs Stiftelse (1)
Grant ID: KAW 2012.0282
Swedish Research Council (1)
Grant ID: 621-2014-3713