Europe PMC

This website requires cookies, and the limited processing of your personal data in order to function. By using the site you are agreeing to this as outlined in our privacy notice and cookie policy.

Abstract 


The spliceosome (SPL) is a majestic macromolecular machinery composed of five small nuclear RNAs and hundreds of proteins. SPL removes noncoding introns from precursor messenger RNAs (pre-mRNAs) and ligates coding exons, giving rise to functional mRNAs. Building on the first SPL structure solved at near-atomic-level resolution, here we elucidate the functional dynamics of the intron lariat spliceosome (ILS) complex through multi-microsecond-long molecular-dynamics simulations of ∼1,000,000 atoms models. The ILS essential dynamics unveils (i) the leading role of the Spp42 protein, which heads the gene maturation by tuning the motions of distinct SPL components, and (ii) the critical participation of the Cwf19 protein in displacing the intron lariat/U2 branch helix. These findings provide unprecedented details on the SPL functional dynamics, thus contributing to move a step forward toward a thorough understanding of eukaryotic pre-mRNA splicing.

Free full text 


Logo of pnasLink to Publisher's site
Proc Natl Acad Sci U S A. 2018 Jun 26; 115(26): 6584–6589.
Published online 2018 Jun 11. https://doi.org/10.1073/pnas.1802963115
PMCID: PMC6042132
PMID: 29891649

All-atom simulations disentangle the functional dynamics underlying gene maturation in the intron lariat spliceosome

Associated Data

Supplementary Materials

Significance

Precursor messenger RNA (pre-mRNA) splicing is a crucial step of gene expression, enabling the maturation of pre-mRNA transcripts into protein-coding mRNAs. In humans, a majestic ribonucleoprotein machinery—the spliceosome—governs this fundamental process, the defects and misregulation of which lead to over 200 human diseases. A thorough understanding of splicing is pivotal for biology and medicine, holding the promise of harnessing it for genome modulation applications. Despite the recent breakthroughs gained by cryo-EM, an atomic-resolution picture of the spliceosome functional plasticity is still missing. Here, all-atom simulations elucidate the cooperative motions underlying the functional dynamics of the spliceosome at a late stage of the splicing cycle, suggesting the role of specific proteins involved in the spliceosome disassembly from an atomic-level perspective.

Keywords: spliceosome, splicing, molecular dynamics, RNA, gene maturation

Abstract

The spliceosome (SPL) is a majestic macromolecular machinery composed of five small nuclear RNAs and hundreds of proteins. SPL removes noncoding introns from precursor messenger RNAs (pre-mRNAs) and ligates coding exons, giving rise to functional mRNAs. Building on the first SPL structure solved at near–atomic-level resolution, here we elucidate the functional dynamics of the intron lariat spliceosome (ILS) complex through multi-microsecond-long molecular-dynamics simulations of ~1,000,000 atoms models. The ILS essential dynamics unveils (i) the leading role of the Spp42 protein, which heads the gene maturation by tuning the motions of distinct SPL components, and (ii) the critical participation of the Cwf19 protein in displacing the intron lariat/U2 branch helix. These findings provide unprecedented details on the SPL functional dynamics, thus contributing to move a step forward toward a thorough understanding of eukaryotic pre-mRNA splicing.

Precursor messenger RNA (pre-mRNA) splicing is a process of paramount importance for living cells, spanning all domains of life. Splicing is a meticulous snip-and-stitch editing of primary RNA transcripts emerging from gene transcription, whereby the noncoding sections (introns) are removed, while the flanking coding regions (exons) are joined together to form functional mRNA filaments (14). Each splicing cycle entails two subsequent SN2 transesterification reactions facilitated by two Mg2+ ions, which enable a two-metal–aided catalysis (4). In eukaryotes, the tailor of pre-mRNA splicing is the spliceosome (SPL), a sophisticated multimegadalton ribonucleoprotein machinery composed of five small nuclear RNAs (snRNAs) (U1, U2, U4, U5, and U6) and hundreds of different proteins endowed with distinct functions (2). snRNAs associate with specific proteins to form the spliceosomal snRNPs subunits, which, together with other complexes like nineteen complex (NTC) and NTC-related (NTR), enzymes, and cofactors, tune the SPL catalytic, structural, and dynamical properties. In this entangled framework, U6 snRNA is directly in charge of catalysis (4), positioning the metals within a catalytic site strikingly similar to that of group II introns ribozymes (G2IRs) (47), the SPL evolutionary ancestors for which we have elucidated the catalytic mechanism (8, 9). SPL is assembled and disassembled at each splicing cycle and undergoes a continuous conformational and compositional remodeling aimed at processing the large and diverse RNA transcripts with single-nucleotide precision. Since genome fidelity is a key prerequisite for healthy cells, it is not surprising that more than 200 human diseases, among which are cancer and neurodegeneration, are associated with splicing defects and misregulations (10, 11). This highlights the urgent need of an in-depth knowledge of the SPL structural and functional cycle (11). High-resolution cryo-electron microscopy (cryo-EM) has prompted a transformative era in the SPL structural biology, providing groundbreaking information on its assembly, composition, and reactivity (3, 5, 6, 1224). However, the mechanistic understanding of this majestic machinery at atomic level of detail is far from being complete. In particular, the SPL conformational plasticity, its impact on splicing activity, and the role of many proteins engaged at different stages of splicing are still poorly understood (10, 12, 25). Among the recent SPL cryo-EM models (5, 6, 1123), here we focus on a structure solved at near-atomic resolution from the yeast Schizosaccharomyces pombe, representing the SPL at the latest stage of splicing, that is, the intron lariat spliceosome (ILS) complex (5, 6). In this study, we provide relevant insights on the SPL functional dynamics via multi-microsecond-long molecular-dynamics (MD) simulations of two large models built on this cryo-EM structure (5, 6) (Fig. 1). Our work unveils the central role of Spp42 (Prp8 in Saccharomyces cerevisiae), which appears to act as an orchestra conductor of the “gene maturation symphony” by tuning and governing the motions of different and specific SPL components. Remarkably, we show that, cooperatively with Spp42, the NTR Cwf19 protein marks the ILS disassembly by displacing the intron lariat (IL)/U2 snRNA double helix (also named branch helix). Intriguingly, our findings perfectly match with the stage of the splicing cycle explored here.

An external file that holds a picture, illustration, etc.
Object name is pnas.1802963115fig01.jpg

Spliceosomal (A) ILS-1 and (B) ILS-2 models built on the yeast S. pombe intron lariat spliceosome (ILS) cryo-EM structure (PDB entry 3JB9) (5, 6). Spp42 is shown as surface (cyan), while the other proteins and RNAs (snRNAs and IL) are depicted as cartoons using different colors. Mg2+ (orange) and Zn2+ (yellow) are shown as spheres, while GDP molecule is represented with atom-colored van der Waals spheres.

Results

SPL Dynamics.

Our MD simulations are based on the ILS complex solved from S. pombe at 3.6-Å average resolution and exceeding 3.2 Å in the core region (5, 6). Although this high-resolution cryo-EM map offers an exquisite structural reconstruction of the SPL, it must be noted that it is still at the limit of what can be used for atomic-level simulations. Two model systems were built, taking into account the central macromolecular constituents (i.e., proteins and RNAs) and ions of the Protein Data Bank (PDB) structure, and simulated in explicit water (SI Appendix, Fig. S1). These models, hereafter referred to as ILS-1 (Fig. 1A) and ILS-2 (Fig. 1B), count 721,089 and 914,099 atoms, respectively (for further details, see SI Appendix, Tables S1 and S2). Overall, three MD replicas of 750 ns for ILS-1 and one replica of 1 μs for ILS-2 resulted in a cumulative simulation time of 3.25 μs. Despite adopting the same simulation protocol, a longer sampling was allowed for the ILS-2 model. This was mostly a precautionary choice related to the larger size of this system (~200,000 atoms).

To date, these are the first SPL models investigated from an atomic-resolution perspective with force field (FF)-based MD simulations (26). From the inspection of root-mean-square deviation (rmsd) and radius of gyration (Rg), both models exhibit converged and comparable structural and dynamical profiles, which assess the reliability of the computational approach adopted to build the models (SI Appendix, Fig. S2). Moreover, our simulations reveal a stable four-Mg2+–ion catalytic site, showing the hallmark architecture also displayed by G2IRs in our previous studies (SI Appendix, Fig. S3) (8, 9). Notably, not only the U6 ligands coordinating the catalytic Mg2+ but also the characteristic triple-helix formed by U6 and U2 are conserved throughout the MD simulations (SI Appendix, Fig. S4). Correlation analyses, principal-component analysis (PCA), and electrostatic calculations were then applied to disentangle the cooperative motions at the basis of the SPL functional dynamics and to unravel their possible link with the electrostatic properties. In the following, we mostly discuss the results obtained out of three MD replicas of ILS-1. The MD simulation of ILS-2 is used as a further confirmation of the functional dynamics observed in ILS-1 and to check the impact of the proteins additionally included in this larger model.

The SPL: A Protein-Directed Ribozyme.

To characterize the SPL functional dynamics and the leading role of Spp42, the largest and the most important protein of the SPL, we have computed the cross-correlation scores (CSs) for each pair of SPL components (i.e., RNAs and proteins, explicitly considering the Spp42 domains and N-terminal motifs), which enable to qualitatively measure their interdependent dynamical coupling (2729). The CSs histogram for ILS-1 model (Fig. 2B) is generated from the per-residue Pearson’s coefficients (CCs) cross-correlation matrix (Fig. 2A) and provides a coarse and prompt picture of the most relevant linearly coupled motions occurring across the system during the MD simulations. Among others, the following crucial connections disclose important results: (i) all of the snRNAs (U2, U5, U6) and the IL are overall positively correlated with Spp42 (especially U5) and most of the other proteins (blue squares highlighted with colored boxes in Fig. 2B), thus indicating that the SPL acts as a protein-directed ribozyme; (ii) Spp42 (Fig. 2C) is highly correlated with specific proteins, therefore governing and tuning the SPL plasticity through its unique multibranched architecture. In particular, the closest dynamical ties are established with two proteins of U5 snRNP (Cwf10 and Cwf17) and with the NTC DExD/H-box Prp5 protein. Indeed, Cwf10, a GTPase involved in SPL remodeling (6), moves in lock-step with the “lasso,” a peculiar N-terminal extended loop of Spp42, whereas Cwf17 and Prp5 are strongly coupled with two protruding “arms” of Spp42, the N-terminal α-helices binding to Cwf17 (B-Cwf17), and to reverse-transcriptase (B-RT), respectively; (iii) finally, U2 and IL positively correlate with Cwf19 and Spp42, pointing to a cooperative functional role involving these two proteins and the branch helix.

An external file that holds a picture, illustration, etc.
Object name is pnas.1802963115fig02.jpg

Cooperative motions underlying the functional dynamics in the spliceosomal ILS-1 model system. (A) Per-residue Pearson’s coefficients (CCs) cross-correlation matrix derived from the mass-weighted covariance matrix constructed over the last 670 ns of MD simulations of ILS-1 (replica #1) for Cα and P atoms. CCs are comprised between −1 (anticorrelation, red) and +1 (correlation, blue). The domains of Cwf10 highlighted in the matrix are (from Left to Right): N-t and G; proteins of Sm-ring are (from Left to Right): D3, B, D1, D2, E, F, and G. (B) Histogram reporting the (per-column) normalized correlation scores (CSs) calculated for each pair of spliceosomal components. Each column reports the CSs calculated between a specific component and all of the others (listed on rows). Spp42 domains and N-terminal motifs are distinctly considered. CS value ranges from −1 (anticorrelated motions) to +1 (lock-step motions). The most relevant correlations are highlighted using colored boxes. (C) Close-up of Spp42 (surface representation) in ILS-1, with the four domains shown/listed with different colors. The five N-terminal motifs are numbered and listed. Abbreviations: B-Cwf17/RT, binding to Cwf17/RT; I. Lariat, intron lariat; RT-f/p, reverse-transcriptase finger/palm; Thumb, Thumb/X.

The IL/U2 Helix Displacement.

Strikingly, behind the intriguing coupling discussed above in iii, the essential dynamics extracted from the PCA unveiled a displacement of the IL/U2 helix, whose unrolling is copromoted by Cwf19 and Spp42 (Fig. 3 and Movie S1). This original finding sheds lights on the so-far-unclear functional annotation of Cwf19 and is fully consistent with the late stage of the splicing cycle under study. In the ILS complex, in fact, the IL/U2 branch helix, formed before the catalysis to bulge out the branch point (BP) adenosine, must be displaced and unwound before IL linearization (i.e., debranching) and degradation (3033).

An external file that holds a picture, illustration, etc.
Object name is pnas.1802963115fig03.jpg

Displacement (highlighted by a black arrow) of the intron lariat (IL)/U2 helix, which unrolls away from the SPL, as revealed by the PCA of MD trajectories. Arrows, colored by protein/RNA, show the motion of Cα/P atoms along the first eigenvector. Cwf19 (green) and Spp42 (cyan) are depicted as cartoon; U2 (orange) and IL (yellow) are shown as tube; U6 (blue) and the other proteins are represented as cartoons. Mg2+ (orange) and Zn2+ (yellow) are shown as spheres.

Interestingly, electrostatic calculations performed on the cryo-EM structure and on representative MD snapshots reveal the presence of a positively charged cavity, rich in arginines and lysines, exposed by the RT domain of Spp42 and by Cwf19 (SI Appendix, Fig. S5). This pocket acts as an electrostatic trap for the negatively charged nucleotides of U2 and IL. The gradual IL/U2 displacement (Fig. 4) appears to be triggered by two key events: (i) the electrostatic recruitment of IL/U2 in the vicinity of the positive pocket; (ii) the electrostatic lock of the BP region, that is, the BP adenosine (A501), and the flanking G100 and C502 nucleotides. This lock is modulated by K364, K366, and R388 of Cwf19, which form a polar tweezers stably anchoring the BP region (Fig. 5). The 3′-terminus of the IL is firmly embedded in the positive pocket, thus favoring the 5′-downstream unrolling of the branch helix. The positive clamp surrounding the BP region constitutes the pivot of the IL/U2 unrolling (Fig. 5), which, assisted by the concomitant action of Cwf19 and Spp42, promotes the IL/U2 displacement. Remarkably, the IL/U2 helix is truncated in the cryo-EM structure (SI Appendix, Fig. S1) (5, 6). This, consistently with our results, may be related to a significant mobility of IL/U2, which most likely hampered its complete reconstruction.

An external file that holds a picture, illustration, etc.
Object name is pnas.1802963115fig04.jpg

The electrostatic origin of the branch helix displacement. Snapshots from MD simulations showing the gradual displacement of the intron lariat (IL)/U2 helix. (Left) Proteins are shown with electrostatic surface (blue/red colors for positive/negative charges, respectively). (Right) Cwf19 (green) and Spp42 (cyan) are represented with surface; U2 (orange), IL (yellow), and U6 (blue) are shown as cartoon. (A) The positively charged pocket, formed by Cwf19 and Spp42 recruits IL/U2. (B) IL/U2 interacts with this pocket and starts unrolling. (C) IL/U2 displacement due to the action of Cwf19 and Spp42.

An external file that holds a picture, illustration, etc.
Object name is pnas.1802963115fig05.jpg

The polar tweezers formed by K364, K366, and R388 of Cwf19. Cwf19 and Spp42 are shown with the electrostatic surface (blue/red colors for positive/negative charges, respectively). U2 (orange), U6 (blue), and IL (yellow) are depicted as cartoons. IL A501 (spheres) and 3′-termini (licorice) are colored by atoms.

The Enlarged (ILS-2) Model.

Here, results from the MD simulation of the enlarged ILS-2 model (Fig. 1B) are compared with those of ILS-1. The PCA scatterplot shows that the conformational space explored along the first two PCs is similar in the two systems (SI Appendix, Fig. S6). A narrower distribution of points is observed for ILS-2, with the contribution of the first three PCs to the overall SPL motion being roughly equivalent to that of PC1-only in ILS-1 (SI Appendix, Fig. S7). The contraction of the conformational space explored by ILS-2 can be ascribed to the proteins (Prp45 and Prp17) and domains (of Spp42 and Cwf10) additionally included in this model. Prp45 in particular appears to stabilize the overall SPL structure, damping and softening the functional motions of ILS-2 (at least within the timescale investigated). Despite this, the coarse CSs histogram (Fig. 6B) obtained from the per-residue cross-correlation matrix (Fig. 6A) confirms the central role of Spp42 (Fig. 6C) in directing the motions of distinct SPL components as observed in ILS-1. Interestingly, in ILS-2, the IL/U2 displacement is shown by PC2 and PC3 (Fig. 7), which display an increased relative contribution to the total variance with respect to that observed in the ILS-1 model (SI Appendix, Fig. S7). Finally, the MD simulation of ILS-2 further assesses the critical role exerted by the polar tweezers, which forms the hinge of the IL/U2 unrolling by tightening the BP region (SI Appendix, Fig. S8). Here, in addition to K364, K366, and R388, also the R385 residue exposed by Cwf19 contributes to lock the BP region.

An external file that holds a picture, illustration, etc.
Object name is pnas.1802963115fig06.jpg

Cooperative motions underlying the functional dynamics in the spliceosomal ILS-2 model system. (A) Per-residue Pearson’s coefficients (CCs) cross-correlation matrix derived from the mass-weighted covariance matrix constructed over the last 920 ns of MD simulations of ILS-2 (replica #1) for Cα and P atoms. CCs are comprised between −1 (anticorrelation, red) and +1 (correlation, blue). The domains of Cwf10 highlighted in the matrix are (from Left to Right): N-t, G, D-II, D-III, D-IV, D-V, and C-t; proteins of Sm-ring are (from Left to Right): D3, B, D1, D2, E, F, and G. (B) Histogram reporting the (per-column) normalized correlation scores (CSs) calculated for each pair of spliceosomal components. Each column reports the CSs calculated between a specific component and all of the others (listed on rows). Spp42 domains and N-terminal motifs are distinctly considered. CS value ranges from −1 (anti-correlated motions) to +1 (lock-step motions). The most relevant correlations are highlighted using colored boxes. (C) Close-up of Spp42 (surface representation) in ILS-2, with the six domains shown/listed with different colors. The five N-terminal motifs are numbered and listed. Abbreviations: B-Cwf17/RT, binding to Cwf17/RT; Endo., endonuclease; I. Lariat, intron lariat; RNase, RNase H-like; RT-f/p, reverse-transcriptase finger/palm; Thumb, Thumb/X.

An external file that holds a picture, illustration, etc.
Object name is pnas.1802963115fig07.jpg

Displacement (highlighted with a black arrow) of the intron lariat (IL)/U2 branch helix, which unrolls away from the SPL, as revealed by the PCA of MD trajectory of ILS-2 model. Arrows, colored according to the protein/RNA filament to which they belong, show the motion of Cα/P atoms along the second eigenvector (PC2, thicker arrows) and third eigenvector (PC3, thinner arrows). Cwf19 (green) and Spp42 (cyan) are depicted as cartoon; U2 (orange) and IL (yellow) are shown as tube; U6 (blue) and the other proteins are represented as cartoons. Mg2+ (orange) and Zn2+ (yellow) are shown as spheres.

Discussion

The SPL is an extremely dynamic machinery, undergoing continuous conformational and compositional changes. Although recent cryo-EM experiments have provided significant insights on this sophisticated macromolecule, the SPL structural/functional plasticity still calls for an in-depth understanding, aimed at better dissecting the specific role of the different proteins recruited along the splicing cycle. In the present study, we disentangle the most important cooperative motions underlying the functional dynamics of the ILS complex from an unprecedented atomic-resolution perspective. As major results, our work elucidates the leading role of the Spp42 protein and supplies key hints for a functional annotation of the NTR Cwf19 protein.

As highlighted by correlation and PCA analyses, our MD simulations corroborate that the SPL acts as a protein-directed ribozyme. Indeed, the three snRNAs of the ILS complex, all included in our models, are extremely tied with the proteins (Spp42 in particular) in their dynamical evolution, suggesting that their motions are directed by the protein scaffold (Figs. 2B and and6B).6B). Significantly, this unique trait of the SPL, so far highlighted by structural considerations based on the cryo-EM models solved in the last years (12, 25), is now sustained also by our simulations. In this scenario, the large Spp42 (Prp8 in S. cerevisiae) protein (Figs. 2C and and6C),6C), which exhibits a peculiar structural organization made of seven distinct domains (i.e., N-terminal, RT finger/palm, thumb/X, linker, endonuclease-like, RNase H-like, Jab1/MPN), is the central hub of the SPL (14, 25). Despite the dramatic compositional and conformational remodeling observed along the splicing cycle, the cryo-EM models of the SPL (mostly from S. cerevisiae) have pictured that Prp8 (Spp42) constitutes a rigid scaffold from the Bact to the ILS complex, with the exception of its highly mobile RNase H-like domain and some peculiar structural motifs and loops (25). Apparently, small conformational changes involving these local structural elements and RNase H-like domain of Prp8 (Spp42) may contribute to finely control the progression along the SPL cycle (25). Remarkably, these experimental observations are fully consistent with the functional dynamics of Spp42 captured from our simulations, showing that Spp42 heads the gene maturation in the ILS by modulating the motions of specific proteins through its sprawling multibranched architecture. The leading role of Spp42 is particularly emphasized by the plasticity of its N-terminal domain, which bears peculiar structural motifs such as the lasso loop or α-helices acting as protruding arms. Indeed, the CSs histograms calculated for ILS-1 (Fig. 2B) and ILS-2 model (Fig. 6B) reveal that, through these N-terminal motifs/loops, Spp42 establishes a close dynamical connection with specific proteins, among which are Cwf10, Cwf17, and Prp5. These couplings are remarkably evident and conserved in both ILS-1 and ILS-2 models. Since it has been hypothesized that the stage-specific splicing factors and enzymes are recruited onto a generally similar SPL scaffold (12, 25), it is tempting to suggest that the functional dynamics of Spp42 (Prp8) elucidated here may be shared also by other SPL complexes assembled along the cycle.

Intriguingly, the fine modulation of ILS cooperative motions occurs while preserving a stable catalytic site placed at the core of the SPL and composed of four Mg2+ ions coordinated by U6 snRNA phosphate ligands (14). This intricate RNA-based active site has been extraordinarily conserved across evolution from bacteria to humans for catalyzing the pre-mRNA splicing reactions (14). However, at variance with G2IRs, where the active site is stabilized by a network of RNA/RNA interactions, in the SPL the catalytic center is embedded in a protein cavity formed by Spp42 (Prp8) (25). In line with these experimental observations, our simulations confirm the stability and the integrity of the catalytic site even at the late stage of splicing studied here (ILS), further assessing the reliability of the adopted simulation protocol.

Of utmost importance is that the cryo-EM structure investigated here provides structural information of Cwf19 (5, 6), whose function has remained so far obscure (32). The S. pombe Cwf19 (CWF19L2 in humans) (19) is the paralog of the S. cerevisiae Drn1, as they show only a conserved C-t domain (32). In S. cerevisiae, Drn1 enhances the activity of its homolog Dbr1 (32), the enzyme that linearizes the lariats by cleaving the 2′,5′-phosphodiester linkage (33). Garrey et al. (32) hypothesized that Cwf19 (as well as CWF19L2 in humans), lacking a complete evolutionary homology with Drn1, may have developed supplemental functions that go beyond the regulation of Dbr1 debranching activity. In this respect, our results appear to foster this theory, suggesting that Cwf19 is effectively involved in the displacement of the branch helix (Figs. 3, ,4,4, and and7),7), a critical event preceding (and necessary for) IL debranching. As a final remark, the electrostatics, which plays a fundamental role in the SPL for the recruitment and the placement of the snRNAs, as well as for the formation of the catalytic cavity (5, 6, 25), is here suggested to be also at the origin of this functional motion affecting the ILS disassembly (Fig. 5 and SI Appendix, Figs. S5 and S8).

In summary, multi-microsecond-long all-atom MD simulations of the heart of the ILS complex provide unprecedented insights on the SPL functional plasticity. This work repaints the key role of Spp42 as that of an orchestra conductor of a gene maturation symphony by strikingly revealing its fine modulation of the functional motions of many distinct SPL components. Among those, the essential dynamics analysis has unveiled an electrostatically driven unrolling and displacement of the branch helix copromoted by Cwf19 and Spp42. This finding provides evidences for a functional involvement of Cwf19 in the ILS disassembly. A detailed comprehension of the eukaryotic splicing has potential breakthrough implications for revolutionary gene modulation therapies to fight complex diseases. In this scenario, our work represents an unprecedented attempt to characterize the SPL functional dynamics with MD simulations and dispenses a noteworthy piece of knowledge for a thorough mechanistic understanding of this fundamental step of gene expression.

Materials and Methods

MD simulations were based on the S. pombe SPL reconstructed with cryo-EM at the average resolution of 3.6 Å (PDB entry 3JB9) (5, 6). The ILS-1 and ILS-2 models were built on the core region of this structure, representing the most important and conserved part of the SPL throughout the splicing cycle. De novo model building, as implemented in Modeler 9, version 16 (34), was used to reconstruct short missing loops where necessary. MD simulations were performed with Gromacs 5 (35) software package. The AMBER-ff12SB FF was adopted for proteins (36), while the ff99+bsc0+χOL3 FF was used for RNAs (37, 38), since these are the most validated and recommended FFs for protein/RNA systems (26). Mg2+ ions were described with the nonbonded fixed point charge FF due to Åqvist (39) as it was shown to properly describe binuclear sites (9). Na+ ions parameters were taken from Joung and Cheatham (40), while Zn2+ ions were modeled with the cationic dummy atoms approach developed by Pang (41). A slow and careful equilibration protocol was adopted before the productive phase as suggested by Šponer et al. (26). The trajectories were inspected and visualized with VMD software (42). All of the analyses, including rmsd, Rg, PCA, and the calculation of the cross-correlation matrices, were done with cpptraj module of Ambertools 16 (43) and with Gromacs 5 (35) suite. The correlation scores (CSs) were derived from the cross-correlation matrices (Figs. 2A and and6A6A and SI Appendix, Figs. S9 and S10), following the approach adopted in previous studies (2729). Electrostatic calculations were performed on the proteins included in ILS-1 and ILS-2 with the adaptive Poisson-Boltzmann solver (APBS 1.4) software (44). A detailed description of the structural models (ILS-1 and ILS-2), systems setup, MD simulations, PCA, correlation analysis, and electrostatic calculations are provided in SI Appendix.

Supplementary Material

Supplementary File

Supplementary File

Acknowledgments

We are grateful for Italian SuperComputing Resource Allocation Grant HP10B1YXX3 for computational resources. A.M. thanks Italian Association for Cancer Research for financial support (MFAG 17134).

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at www.pnas.org/lookup/suppl/10.1073/pnas.1802963115/-/DCSupplemental.

References

1. Papasaikas P, Valcárcel J. The spliceosome: The ultimate RNA chaperone and sculptor. Trends Biochem Sci. 2016;41:33–45. [Abstract] [Google Scholar]
2. Hoskins AA, Moore MJ. The spliceosome: A flexible, reversible macromolecular machine. Trends Biochem Sci. 2012;37:179–188. [Europe PMC free article] [Abstract] [Google Scholar]
3. Matera AG, Wang Z. A day in the life of the spliceosome. Nat Rev Mol Cell Biol. 2014;15:108–121. [Europe PMC free article] [Abstract] [Google Scholar]
4. Fica SM, et al. RNA catalyses nuclear pre-mRNA splicing. Nature. 2013;503:229–234. [Europe PMC free article] [Abstract] [Google Scholar]
5. Hang J, Wan R, Yan C, Shi Y. Structural basis of pre-mRNA splicing. Science. 2015;349:1191–1198. [Abstract] [Google Scholar]
6. Yan C, et al. Structure of a yeast spliceosome at 3.6-angstrom resolution. Science. 2015;349:1182–1191. [Abstract] [Google Scholar]
7. Steitz TA, Steitz JA. A general two-metal-ion mechanism for catalytic RNA. Proc Natl Acad Sci USA. 1993;90:6498–6502. [Europe PMC free article] [Abstract] [Google Scholar]
8. Casalino L, Palermo G, Rothlisberger U, Magistrato A. Who activates the nucleophile in ribozyme catalysis? An answer from the splicing mechanism of group II introns. J Am Chem Soc. 2016;138:10374–10377. [Abstract] [Google Scholar]
9. Casalino L, Palermo G, Abdurakhmonova N, Rothlisberger U, Magistrato A. Development of site-specific Mg2+-RNA force field parameters: A dream or reality? Guidelines from combined molecular dynamics and quantum mechanics simulations. J Chem Theory Comput. 2017;13:340–352. [Abstract] [Google Scholar]
10. Shi Y. Mechanistic insights into precursor messenger RNA splicing by the spliceosome. Nat Rev Mol Cell Biol. 2017;18:655–670. [Abstract] [Google Scholar]
11. Lee SCW, Abdel-Wahab O. Therapeutic targeting of splicing in cancer. Nat Med. 2016;22:976–986. [Europe PMC free article] [Abstract] [Google Scholar]
12. Fica SM, Nagai K. Cryo-electron microscopy snapshots of the spliceosome: Structural insights into a dynamic ribonucleoprotein machine. Nat Struct Mol Biol. 2017;24:791–799. [Europe PMC free article] [Abstract] [Google Scholar]
13. Cate JH. STRUCTURE. A Big Bang in spliceosome structural biology. Science. 2016;351:1390–1392. [Abstract] [Google Scholar]
14. Wan R, Yan C, Bai R, Huang G, Shi Y. Structure of a yeast catalytic step I spliceosome at 3.4 Å resolution. Science. 2016;353:895–904. [Abstract] [Google Scholar]
15. Wan R, et al. The 3.8 Å structure of the U4/U6.U5 tri-snRNP: Insights into spliceosome assembly and catalysis. Science. 2016;351:466–475. [Abstract] [Google Scholar]
16. Yan C, Wan R, Bai R, Huang G, Shi Y. Structure of a yeast activated spliceosome at 3.5 Å resolution. Science. 2016;353:904–911. [Abstract] [Google Scholar]
17. Yan C, Wan R, Bai R, Huang G, Shi Y. Structure of a yeast step II catalytically activated spliceosome. Science. 2017;355:149–155. [Abstract] [Google Scholar]
18. Galej WP, et al. Cryo-EM structure of the spliceosome immediately after branching. Nature. 2016;537:197–201. [Europe PMC free article] [Abstract] [Google Scholar]
19. Nguyen TH, et al. CryoEM structures of two spliceosomal complexes: Starter and dessert at the spliceosome feast. Curr Opin Struct Biol. 2016;36:48–57. [Europe PMC free article] [Abstract] [Google Scholar]
20. Nguyen THD, et al. Cryo-EM structure of the yeast U4/U6.U5 tri-snRNP at 3.7 Å resolution. Nature. 2016;530:298–302. [Europe PMC free article] [Abstract] [Google Scholar]
21. Zhang X, et al. An atomic structure of the human spliceosome. Cell. 2017;169:918–929.e14. [Abstract] [Google Scholar]
22. Bertram K, et al. Cryo-EM structure of a human spliceosome activated for step 2 of splicing. Nature. 2017;542:318–323. [Abstract] [Google Scholar]
23. Wan R, Yan C, Bai R, Lei J, Shi Y. Structure of an intron lariat spliceosome from Saccharomyces cerevisiae. Cell. 2017;171:120–132.e12. [Abstract] [Google Scholar]
24. Fica SM, et al. Structure of a spliceosome remodelled for exon ligation. Nature. 2017;542:377–380. [Europe PMC free article] [Abstract] [Google Scholar]
25. Shi Y. The spliceosome: A protein-directed metalloribozyme. J Mol Biol. 2017;429:2640–2653. [Abstract] [Google Scholar]
26. Šponer J, et al. How to understand atomistic molecular dynamics simulations of RNA and protein-RNA complexes? Wiley Interdiscip Rev RNA. 2017;8:e1405. [Abstract] [Google Scholar]
27. Palermo G, et al. Protospacer adjacent motif-induced allostery activates CRISPR-Cas9. J Am Chem Soc. 2017;139:16028–16031. [Europe PMC free article] [Abstract] [Google Scholar]
28. Palermo G, Miao Y, Walker RC, Jinek M, McCammon JA. Striking plasticity of CRISPR-Cas9 and key role of non-target DNA, as revealed by molecular simulations. ACS Cent Sci. 2016;2:756–763. [Europe PMC free article] [Abstract] [Google Scholar]
29. Pavlin M, et al. A computational assay of estrogen receptor α antagonists reveals the key common structural traits of drugs effectively fighting refractory breast cancers. Sci Rep. 2018;8:649. [Europe PMC free article] [Abstract] [Google Scholar]
30. Yoshimoto R, Kataoka N, Okawa K, Ohno M. Isolation and characterization of post-splicing lariat-intron complexes. Nucleic Acids Res. 2009;37:891–902. [Europe PMC free article] [Abstract] [Google Scholar]
31. Martin A, Schneider S, Schwer B. Prp43 is an essential RNA-dependent ATPase required for release of lariat-intron from the spliceosome. J Biol Chem. 2002;277:17743–17750. [Abstract] [Google Scholar]
32. Garrey SM, et al. A homolog of lariat-debranching enzyme modulates turnover of branched RNA. RNA. 2014;20:1337–1348. [Europe PMC free article] [Abstract] [Google Scholar]
33. Khalid MF, Damha MJ, Shuman S, Schwer B. Structure-function analysis of yeast RNA debranching enzyme (Dbr1), a manganese-dependent phosphodiesterase. Nucleic Acids Res. 2005;33:6349–6360. [Europe PMC free article] [Abstract] [Google Scholar]
34. Sali A, Blundell TL. Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993;234:779–815. [Abstract] [Google Scholar]
35. Van Der Spoel D, et al. GROMACS: Fast, flexible, and free. J Comput Chem. 2005;26:1701–1718. [Abstract] [Google Scholar]
36. Maier JA, et al. ff14SB: Improving the accuracy of protein side chain and backbone parameters from ff99SB. J Chem Theory Comput. 2015;11:3696–3713. [Europe PMC free article] [Abstract] [Google Scholar]
37. Pérez A, et al. Refinement of the AMBER force field for nucleic acids: Improving the description of alpha/gamma conformers. Biophys J. 2007;92:3817–3829. [Europe PMC free article] [Abstract] [Google Scholar]
38. Zgarbová M, et al. Refinement of the Cornell et al. nucleic acids force field based on reference quantum chemical calculations of glycosidic torsion profiles. J Chem Theory Comput. 2011;7:2886–2902. [Europe PMC free article] [Abstract] [Google Scholar]
39. Aqvist J. Ion water interaction potentials derived from free-energy perturbation simulations. J Phys Chem. 1990;94:8021–8024. [Google Scholar]
40. Joung IS, Cheatham TE., 3rd Determination of alkali and halide monovalent ion parameters for use in explicitly solvated biomolecular simulations. J Phys Chem B. 2008;112:9020–9041. [Europe PMC free article] [Abstract] [Google Scholar]
41. Pang YP. Novel zinc protein molecular dynamics simulations: Steps toward antiangiogenesis for cancer treatment. J Mol Model. 1999;5:196–202. [Google Scholar]
42. Humphrey W, Dalke A, Schulten K. VMD: Visual molecular dynamics. J Mol Graph. 1996;14:33–38, 27–28. [Abstract] [Google Scholar]
43. Case DA, et al. 2016. AMBER 2016 (University of California, San Francisco)
44. Baker NA, Sept D, Joseph S, Holst MJ, McCammon JA. Electrostatics of nanosystems: Application to microtubules and the ribosome. Proc Natl Acad Sci USA. 2001;98:10037–10041. [Europe PMC free article] [Abstract] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

Citations & impact 


Impact metrics

Jump to Citations

Citations of article over time

Alternative metrics

Altmetric item for https://www.altmetric.com/details/43604758
Altmetric
Discover the attention surrounding your research
https://www.altmetric.com/details/43604758

Smart citations by scite.ai
Smart citations by scite.ai include citation statements extracted from the full text of the citing article. The number of the statements may be higher than the number of citations provided by EuropePMC if one paper cites another multiple times or lower if scite has not yet processed some of the citing articles.
Explore citation contexts and check if this article has been supported or disputed.
https://scite.ai/reports/10.1073/pnas.1802963115

Supporting
Mentioning
Contrasting
4
106
0

Article citations


Go to all (33) article citations

Data 


Data behind the article

This data has been text mined from the article, or deposited into data resources.

Similar Articles 


To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.