Abstract
Free full text
Karyopherin-mediated nucleocytoplasmic transport
Abstract
Efficient and regulated nucleocytoplasmic trafficking of macromolecules to the correct subcellular compartment is critical for proper functions of the eukaryotic cell. The majority of the macromolecular traffic across the nuclear pores is mediated by the Karyopherin-β (or Kap) family of nuclear transport receptors. Work over more than two decades has shed considerable light on how the different Kap family members bring their respective cargoes into the nucleus or the cytoplasm in efficient and highly regulated manners. In this Review, we overview the main features and established functions of Kap family members, describe how Kaps recognize their cargoes and discuss the different ways in which these Kap–cargo interactions can be regulated, highlighting new findings and open questions. We also describe current knowledge of the import and export of the components of three large gene expression machines — the core replisome, RNA polymerase II and the ribosome — pointing out the questions that persist about how such large macromolecular complexes are trafficked to serve their function in a designated subcellular location.
The nucleus is bound by the double-membraned nuclear envelope, which separates the genome from the rest of the eukaryotic cell. Nuclear pore complexes (NPCs) span the nuclear envelope, serving both as the main conduit for molecules between the nucleus and cytoplasm and as a permeability barrier to limit the passage of macromolecules (Supplementary Box 1) and to ensure the maintenance of nuclear composition. The conserved Karyopherin-β (Kap) family of nuclear transport receptors mediates the majority of transport of macromolecules, especially of proteins, across the NPC into the nucleus (importins), out of the nucleus (exportins) or in both directions (biportins)1–3. The exceptions of a few Kap-independent transport pathways for a few specific classes of macromolecules are described in Supplementary Box 2. Although the exact numbers of Kap family members vary in different organisms, most are identifiable and conserved across eukaryotes, suggesting a conservation of function throughout the eukaryotic kingdom4 (Supplementary Box 3). Of the 20 Kaps identified in human cells, 10 are importins, 5 are exportins, 3 are biportins and the functions of 2 Kaps remain unknown (TABLE 1). Kaps are critically involved in many eukaryotic cellular processes that include response to the environment, signal transduction, regulation and maintenance of the cell cycle. It is therefore not surprising that disruptions of Kap functions contribute to diseases such as neurodegeneration, cancer, inflammation and viral infections, making these transport receptors promising therapeutic targets (BOX 1).
Table 1 |
Kap | Other names | Isoformsa | Yeast homologue | Modes of cargo bindingb — consensus sequence (binding affinity/dissociation constants) | Functions of putative cargoesc | 59 ptRAN–GTP binding affinity |
---|---|---|---|---|---|---|
Importins | ||||||
Importin-β (IMPβ) | p97, Kapβ1, KPNB1, IPO1, IPOB, IMB1, NTF97 | 2 | KAP95 | cNLS via IMPα, with monopartite (K-K/R-X-K/R) or bipartite (K/R-K/R-X10–12-K/R3/5) sequences (single-digit nanomolar to micromolar21,27,221,222) | Broad functions in gene expression and cell cycle regulation9, inner nuclear membrane proteins, spliceosomal small nuclear ribonucleoproteins via SNUPN | Hundreds of picomolars95 |
IBB of adaptors (~2 nM1) | ||||||
Diverse folded domains (~2–200 nM1) | ||||||
Transportin/Karyopherin-β 2 (KAPβ2)d | Impβ2, KPNB2, IPO2, TNPO1, TRN1 | 3 | KAP104 | PY-NLS — hydrophobic/basic and R/K/H-X2–5-P-Y (single-digit to tens of nanomolars30–32,223) | mRNA processing9 | Hundreds of picomolars95 |
RGG regions (hundreds of nanomolars49,51) | ||||||
Transportin 2/Karyopherin-β2b (KAPβ2b)d | Impβ2b, KPNB2B, IPO3, TNPO2, TRN2 | 2 | NA | PY-NLS | NA | Unknown |
Transportin 3/Transportin-SR (TNPO3) | TrnSR, TrnSR2, Trn3, Imp12, IPO12 | 4 | MTR10 | RS-NLS — phospho-RS or RD/RE dipeptide repeats (hundreds of nanomolars to low micromolar36,51) | mRNA splicing and export9 | Hundreds of nanomolars95 |
Importin 4 (IPO4) | RanBP4, Imp4, Imp4b | 2 | YRB4/KAP123 | Linear segments | Chromatin organization9, vesicular transport and cytoskeleton organization8, ribosome biogenesis (yeast)197 | Tens of nanomolars95 |
Diverse folded domainsb | ||||||
Importin 5 (IPO5)d | Impβ3, Kapβ3, RanBP5, KPNB3, Imp5, IMB3 | 3 | PSE1/KAP121 | IK-NLS — K-V/I-X-K-X1–2-K/H/R (tens of nanomolars33) | Chromatin organization, ribosome biogenesis, cytokinesis9, vesicular transport and cytoskeleton organization8 | Hundreds of picomolars95 |
Diverse folded domainsb | ||||||
Importin 7 (IPO7)d | RanBP7, IPO7 | 1 | NMD5/KAP119 | Diverse folded domainsb | Cell cycle regulation, ribosome biogenesis9 | Single-digit to tens of nanomolars95 |
Importin 8 (IPO8)d | RanBP8, Imp8 | 2 | SXM1/KAP108 | Diverse folded domainsb | Ribosome biogenesis8,9 | Single-digit to tens of nanomolars95 |
Importin 9 (IPO9) | RanBP9, Imp9 | 1 | KAP114 | Diverse folded domains (low nanomolar59) | Nucleosome organization, ribosome biogenesis9 | Tens of nanomolars95 |
Importin 11 (IPO11) | RanBP11, Imp11 | 2 | KAP120 | Diverse folded domainsb | Developmental processes9 | Hundreds of nanomolars95 |
Exportins | ||||||
Chromatin Region Maintenance 1/Exportin 1 (CRM1/XPO1) | Exp1, emb | 1 | CRM1 | NES, class 1a–1d: Φ-X2–3-Φ-X2–3-Φ-X-Φ, class 2: Φ-X-Φ-X2-Φ-X-Φ, class 3: Φ-X2-Φ-X3-Φ-X2-Φ, class 4: Φ-X3-Φ-X2-Φ-X3-Φ, class 1a-R–1d-R: Φ-X-Φ-X2–3-Φ-X2–3-Φ (tens of nanomolars to tens of micromolars112) | Broadly functioning protein cargoes11, and various RNA via adaptors | Undetectable95 |
Cellular Apoptosis Susceptibility/Exportin 2 (CAS/XPO2) | Exp2, CSE1L, Impα re-exporter | 4 | CSE1 | Autoinhibited and folded Impα (~1 nM224) | IMPα and, possibly, more proteins8 | Undetectable95 |
Exportin-t (XPOT) | Exp-T, XPO3 | 1 | LOS1 | RNA structures (~3 nM225) | tRNA and other RNAs77 | Undetectable95 |
Exportin 5 (XPO5) | Exp5, RanBP21 | 1 | MSN5 | RNA structures | Pre-miRNA and other RNAs77,156 | Tens of nanomolars95 |
Undefined phospho-NESb | ||||||
Exportin 6 (XPO6) | Exp6, RanBP20 | 2 | NA | Undefined folded domainsb | Actin–profilin226 | Unknown |
Biportins | ||||||
Importin 13 (IPO13) | Kap13, RanBP13, IPO13 | 1 | PDR6/KAP122 | Diverse folded domains (import cargo: hundreds of nanomolars; export cargo: single-digit micromolar68) | Transcription factors and RNA binding proteins9 | Tens to hundreds of nanomolars68,95 |
Exportin 4 (XPO4) | Exp4 | 1 | NA | Diverse folded domains (export cargo: ~2 nM71) | RNA processing, ribosome biogenesis9 | ~40 nM71 |
Exportin 7 (XPO7)d | Exp7, RanBP16 | 1 | NA | Diverse folded domainsb | Broadly functioning protein cargoes11 | Unknown |
Unknown function | ||||||
RanBP6d | NA | 1 | NA | NA | NA | NA |
RanBP17d | NA | 2 | NA | NA | NA | NA |
cNLS, classical nuclear localization signal; IBB, amino-terminal IMPβ-binding; IK-NLS, isoleucine-lysine nuclear localization signal; NA, not available; NES, nuclear export signal; PY-NLS, proline-tyrosine nuclear localization signal; RS-NLS, arginine-serine repeat nuclear localization signal; Φ, hydrophobic residue; SNUPN, Snurportin-1; X, any amino acid.
For a Kap to transport its cargoes between the nucleus and the cytoplasm in the correct direction, it must first bind the cargo in the originating compartment (FIG. 1a). Cargoes use linear elements known as nuclear localization or export signals (NLSs or NESs) and/or folded domains to bind specific Kaps. The Kap–cargo complex then passes through the NPC by binding the phenylalanine-glycine (FG) repeats of nucleoporins located in the central pore of the NPC (so-called FG Nups) (Supplementary Box 1 and Supplementary Box 3). Finally, the cargo must be released from the Kap in the destination compartment.
The directionality of Kap-mediated transport is controlled by the RAN GTPase3 (FIG. 1a). RAN–GTP and RAN–GDP are asymmetrically localized in the nucleus and cytoplasm, respectively. Nuclear RAN is mostly bound to GTP because its guanine nucleotide exchange factor RCC1 is bound to chromatin, whereas cytoplasmic RAN GTPase-activating protein (RANGAP) and RAN-binding protein RANBP1 activate GTP hydrolysis to keep cytosolic RAN in the GDP-bound state. Kaps bind import cargoes and RAN–GTP in a mutually exclusive manner but bind export cargoes and RAN–GTP cooperatively. Thus, importins and biportins can bind import cargoes in the cytoplasm and translocate across NPCs to the nucleus where they bind RAN–GTP and release the cargoes. Conversely, exportins and biportins form ternary complexes with export cargoes and RAN–GTP in the nucleus, which translocate across NPCs to the cytosol where they disassemble upon GTP hydrolysis.
This Review describes the many ways Kaps recognize cargoes and how Kap–cargo interactions are modulated. We also cover knowledge about the trafficking of three gene expression machineries, one each involved in replication, transcription and translation, as case studies to showcase our current, albeit still limited, understanding of how they enter/exit the nucleus and how these processes are coordinated with their assembly/disassembly and functions.
Traffic in and out of nucleus
The organization of nucleocytoplasmic traffic mediated by different members of the Kap family and the links to biological function for individual Kaps are evolving topics of study and discussion. Dozens to hundreds of cargoes were biochemically identified to be transported by well-studied Kaps such as IMPβ (via cargo adaptor IMPα), KAPβ2 and CRM1, but such lists for other Kaps were much smaller1. Fortunately, several recent proteomics studies expanded the lists of candidate cargoes for many importins, a few exportins and a few biportins; it is important to note, however, that most of the interactions of these cargoes with the Kaps have not been individually verified as yet5–12. Similar numbers of candidate import cargoes were identified for each Kap; one study found 49–66 import cargoes for each of the 10 human importins and 2 biportins, whereas another listed ~400–800 interaction partners (direct and indirect) for each of the 6 importins and 2 biportins studied8,9. This relatively even distribution of import traffic amongst Kaps is different from previous assumptions that the most-studied IMPα–IMPβ pathway imports more cargoes than other importin systems13. Some importins seem to import functionally related cargoes (TABLE 1); for instance, IMPβ imports many cargoes linked to gene expression, KAPβ2 cargoes include many mRNA processing factors and TNPO3 imports many cargoes linked to transcription elongation, mRNA splicing and export9. However, current understanding of the traffic mediated by many importins remains limited and it is difficult to define the functional preferences or significance of individual Kaps.
In studies of nuclear export traffic, a proteomics study of CRM1 identified ~1,000 potential export cargoes. Only ~10% of these proteins were previously reported as CRM1 cargoes. This suggested that CRM1 carries a far larger traffic volume than previously thought, covering broadly functioning cargoes, and that CRM1 may have a role in generally keeping cytoplasmic proteins out of the nucleus11. Additionally, proximity-dependent biotin identification (BioID) found hundreds of proteins that interact with the exportin CAS, which is generally thought to export only IMPα8. However, further analysis showed that many of these proteins are IMPα cargoes that could have been identified due to the close proximity of CAS with IMPα–IMPβ–cargo complexes during the disassembly of import complexes in the nucleus8. More comprehensive traffic analyses for other exportins have not been reported. Proteomics studies also reported new candidate export and import cargoes for the biportins XPO7 and IPO13 (REFS8,10,11). Gene Ontology analysis of candidate XPO7 cargoes indicates broad cellular functions, suggesting that XPO7 may be a more general exporter. Gene Ontology analyses were not reported for candidate CAS and IPO13 cargoes.
Proteomics analysis also revealed that a subset of import cargoes are transported by multiple importins, especially by a group consisting of IPO4, IPO5 and XPO7 (REF.8) and by highly conserved pairs of Kaps: KAPβ2 and KAPβ2b, as well as IPO7 and IPO8 (REF.9). Considerably fewer cargoes seem to be exported by multiple exportins8. An important caveat of proteomics studies is that individual identified cargoes have not been validated, and thus cannot be considered true cargoes. Experiments showing direct and RAN–GTP-sensitive Kap–cargo interactions (for import cargoes) or RAN–GTP-cooperative Kap–cargo interactions (for export cargoes) are necessary to eliminate false positives. Overall, the understanding of Kap traffic flow and overlap are just beginning to emerge, and will be improved by careful use of Kap-specific inhibitors (BOX 1).
Recognition of cargoes by Kaps
Kaps are proteins of molecular weight around 95–140 kDa and are composed of 19–24 HEAT repeats arranged in a superhelical or solenoid structure. The convex and concave surfaces of the superhelical structure as well as hydrophobic grooves between HEAT repeat helices and loops of various length that connect the HEAT repeats are utilized in different ways to recognize cargo, RAN–GTP and FG Nups. Kaps use diverse modes to bind cargoes, binding linear sequence elements (NLSs or NESs) and/or folded domains within the cargoes (FIG. 1b–d). The mechanisms of cargo recognition are known for some Kaps but remain undefined for many other Kaps.
Recognition of linear targeting signals
Protein cargoes commonly use NLSs and NESs to bind their cognate Kaps for import and export, respectively. Only four classes of NLSs and one class of NESs have been defined, each for a specific Kap, and targeting signals for many Kaps remain undefined14,15. Each NLS/NES class has distinct sequence patterns and binds to different sites on their cognate Kap. NLSs/NESs do share some common features: they are often found in intrinsically disordered regions (IDRs) or at termini of cargoes; they become ordered, adopting either extended or helical conformations, upon binding Kaps; many NLSs are enriched in basic residues as Kaps are acidic; and the peptides in an NLS/NES class can be highly diverse in sequence. These common features and the exceptions are discussed below.
NLS and NES classes.
The oldest class of NLS is the highly basic classical NLS (cNLS) that binds IMPα proteins (seven subtypes in humans; see Supplementary Table 1)16, which are adaptors for IMPβ. The two subclasses of cNLS are the monopartite cNLS that is 5–7 residues long (consensus: K-K/R-X-K/R) and the bipartite cNLS with 2 linker-connected basic clusters (consensus: K/R-K/R-X10–12-K/R3/5, where X10–12 is a linker of 10–12 residues and K/R3/5 refers to three basic residues within five consecutive residues)17,18. Recent findings suggest that the linkers can be much longer than the classical ones of 10–12 residues and may contain α-helical elements or even entire folded domains19,20. cNLSs bind two sites, major and minor, on the concave surface of the IMPα armadillo (ARM) repeat domain; most monopartite cNLSs bind to the major site and a few bind to the minor site, whereas bipartite cNLSs occupy both sites21. The residues that form the concave surface of IMPα, including the major and minor cNLS binding sites, are virtually identical between IMPα subtypes16. The amino-terminal IMPβ-binding (IBB) region of IMPα contains an NLS-like49KRRNV53 segment that binds the major cNLS-binding site of unliganded IMPα and is thus autoinhibitory22,23. IMPβ binding to the IBB frees up the IMPα major site to bind cNLSs on cargoes, increasing cargo-binding affinity by >100-fold24–27 (BOX 1; TABLE 1). However, some cNLSs bind to IMPα with such high affinity as to displace the IBB in the absence of IMPβ; examples include cNLSs of several inner nuclear membrane proteins (Supplementary Box 2). The extent of IBB autoinhibition varies for different IMPα subtypes, conferring subtype specificity for some cargoes that can bind IMPα in the absence of IMPβ28,29. The invariant cNLS binding surface shared by all IMPα subtypes suggests that subtype specificity is determined by interactions outside the cNLS-binding sites16,29. Additional examples of IMPα subtype specificity are discussed below (also reviewed elsewhere16).
The second type of NLS defined is the proline-tyrosine NLS (PY-NLS) that binds KAPβ2 or KAPβ2b. PY-NLSs are 15 to >100 residues long, have overall positive charge and bind in mostly extended conformations to a negatively charged site of KAPβ2 (REFS1,30–32) (FIG. 2). PY-NLSs are very diverse in sequence and best defined by a set of physical, chemical and sequence rules. Sequence motifs include an N-terminal hydrophobic or basic motif and a carboxy-terminal R/K/H-X2–5-P-Y or R/K/H-X2–5-P-ϕ motif (ϕ, hydrophobic; H, histidine; P, proline; Y, tyrosine). The N-terminal, R/K/H and PY motifs quasi-independently contribute to overall binding energy and can be used in combination such that two of the three motifs can sometimes confer import activity. Hence, some PY-NLSs do not have a clearly recognizable PY motif30.
The third NLS class is the isoleucine-lysine NLS (IK-NLS) that binds yeast KAP121 (FIG. 2). IK-NLSs have the consensus sequence K-V/I-X-K-X1–2-K/H/R, where the valine (V)/isoleucine (I) and the lysine (K) in the fourth NLS position are Kap121-binding hotspots and Kap–cargo specificity determinants33,34. Little is known about sequences that bind the homologous human IPO5, but KAP121 residues that contact IK-NLSs are conserved in IPO5, suggesting that it likely also binds IK-NLS-containing cargoes33. Both the IK-NLS and monopartite cNLS contain multiple lysine residues and are compact, but with different sequence patterns (K-V/I-X-K-X1–2-K/H/R versus K-K/R-X-K/R). Little is known about whether they bind KAP121/IPO5 and IMPα interchangeably. One study showed that overexpression of IK-NLS of Ubiquitin-like-specific protease 1 (ULP1) inhibited the KAP121 but not the IMPα pathway35, but potential cross-interactions need further investigation.
The fourth class of NLS is the TNPO3-binding arginine-serine repeat NLS (RS-NLS) found in RS domains (>50 residues long, >40% RS dipeptide) of many serine-arginine (SR) splicing factors. The RS-NLS is often immediately C-terminal to an RNA-recognition motif (RRM) domain. Many RS-NLSs must be phosphorylated on serine residues to bind TNPO3 (REFS36–38); the alternating arginine and phospho-serine residues bind complementary acidic and basic patches on TNPO3 (FIG. 2). Some cargoes, such as pre-mRNA cleavage factor CPSF6, have arginine-glutamate/aspartate instead of phospho-RS dipeptides36.
At this time, the only known export signal class is the one that binds CRM1, hence it is simply called the NES. NESs are usually 8–15 residues long and contain 4–5 hydrophobic anchor residues that interact with CRM1, arranged with various spacings14. The first NES sequences discovered had many leucines, but it is now obvious that NESs are very diverse in sequence and structure14,39,40. NESs can bind in both polypeptide directions and occupy different extents of a structurally invariant hydrophobic groove on the convex/outside surface of the ring-shaped CRM1 (REFS14,41–44) (FIG. 3a). NES peptides adopt various helix, β-strand, loop or turn structures that position hydrophobic side chains into hydrophobic pockets in the groove of CRM1. The only conserved structural element of the NES is a single helical turn that forms hydrogen bonds with a conserved CRM1 lysine that acts as a selectivity filter14. An active NES has to meet only this one structural requirement of three or four residues and the rest of it can be structurally diverse, resulting in the large sequence variability of NESs. At this time, there are 11 NES consensus patterns (TABLE 1), but there may be more to be uncovered. The very relaxed sequence requirements of an NES allow CRM1 to recognize thousands of diverse NESs in broadly functioning protein cargoes.
Potential NLSs and NESs.
Other Kap-binding segments within IDRs of cargoes have been identified, but their Kap-binding elements have not been determined and/or they have not been found in multiple different cargoes that are recognized by the same Kap. Therefore, they are yet to be classified as NLSs or NESs.
IMPβ binds directly to linear segments of several cargoes. The IBB in IMPα and other IMPβ adaptors, such as the U small nuclear RNP-specific nuclear import adapter Snurportin 1 (SNUPN)45 and Replication Protein A (RPA) complex import adaptor RPA-interacting protein (RIP)46, is not usually thought of as an NLS but it is much more similar to a linear signal than a folded domain. This ~40-residue region or motif, which is enriched in basic and hydrophobic residues, is intrinsically disordered but becomes a long helix that is wrapped around by the IMPβ superhelix upon IMPβ binding1,47 (FIG. 2). IMPβ also binds directly, without using IMPα, to at least 29 different cargo proteins that share no obvious common elements that bind IMPβ (TABLE 1). Of these, a few use disordered and basic linear cargo segments to bind IMPβ; they are often simply called ‘non-classical NLSs’. In particular, the neuroendocrine peptide parathyroid hormone-related protein (PTHrP) uses a 28-residue IDR segment to bind in an extended conformation to the N-terminal half of IMPβ, in a site that partially overlaps with the IBB-binding site48. PTHrP sequence motifs that are important for IBB have not been mapped and it is unclear whether PTHrP-like peptides exist in other IMPβ cargoes. Further studies of IBB-like and PTHrP-like cargo segments are needed before they can be classified as NLSs.
KAPβ2 also binds at least one other type of linear targeting element. The importin binds arginineglycine-glycine or RGG IDRs in many RNA-binding proteins, including the translational repressor cold-inducible RNA-binding protein (CIRBP) and multifunctional RNA/DNA binding protein fused in sarcoma (FUS) (Supplementary BOX 4). RGG regions bind dynamically to KAPβ2 sites that overlap with the PY-NLS binding site49–51. They bind weaker than PY-NLSs (TABLE 1) and are outcompeted by neighbouring PY-NLSs. For example, wild-type FUS uses its high-affinity PY-NLS to bind Kapβ2, but the amyotrophic lateral sclerosis-associated variant FUS(R495X) that is missing the PY-NLS uses RGG regions to bind KAPβ2 (REF.49). The prevalence of cargoes that use RGGs to bind KAPβ2 is undetermined and it is currently unknown whether there are additional sequence motifs unrelated to RGGs and the PY-NLS that also bind KAPβ2.
The remaining importins (IPO4, IPO5, IPO7, IPO8, IPO9 and IPO11) bind linear sequence elements in some cargoes, but the NLSs have not been defined due to lack of biochemical and structural information about how they bind their importins1. IPO5 is predicted to recognize IK-NLS-containing cargoes, but it is unclear whether it also recognizes other types of NLSs. Recent studies showed several importins (IPO4, IPO5, IPO7 and IPO9 along with IMPα, IMPβ and KAPβ2) binding the basic disordered tails of histones H3 and H4, but it is unclear how important these interactions are for nuclear import52. Slightly more is known for IPO7 as it was shown to mediate stimulated import of a few kinases (SYK, ERK2 and MEK1) and transcription factors SMAD3 and EGR1 upon phosphorylation of their serine-proline-serine motifs. It is not known whether the motifs bind IPO7 directly or whether phosphorylation exposes a separate NLS or an importin-binding surface. Comprehensive biochemical and structural analyses of candidate cargoes obtained from proteomics studies will likely reveal many new NLS classes.
No additional NES class, other than the one for CRM1, has been defined for other exportins. No other linear segments found in multiple cargoes are known to direct CRM1-mediated export. This is true for other exportins and biportins except for the yeast exportin MSN5, which exports many phosphorylated proteins by binding to a still undefined phospho-NES that is located in the IDRs of its cargoes. For example, transcription factor PHO4 must be phosphorylated at two specific sites within a 150-residue-long IDR inter-domain linker53. Sixteen other MSN5 cargoes, all involved in either environmental response or cell cycle regulation, contain IDR segments that drive export only when phosphorylated. No common sequence/structural features have been identified for this potential phospho-NES (Supplementary Table 2). Although the human homologue of MSN5 (XPO5) is known to export RNAs, its one putative protein cargo is the phosphorylated RNA editing enzyme ADAR1, suggesting that the recognition of a phospho-NES may be evolutionarily conserved54,55. Further studies are needed to understand how this phospho-NES binds its exportin.
Altogether, although five NES/NLS classes have been defined, cargoes for many Kaps need to be studied to reveal new NLS and NES classes. Detailed knowledge of the sequence and structural requirements for interactions between Kaps and linear signals within their cargoes will improve existing NLS/NES predictors (BOX 2) and enable development of new predictors.
Recognition of folded domains
Some protein cargoes use folded domains rather than linear NLSs/NESs to bind their Kaps. Although IMPα predominantly binds to linear cNLSs, a few cargoes exclusively use folded domains to bind a distinct non-cNLS-binding site on IMPα, thus conferring IMPα subtype specificity (for example, Ebola virus secondary matrix protein VP24 binding to IMPα6 (REF.56)). Several IMPβ−cargo interactions also involve globular cargo domains. Examples include the cholesterol regulator sterol regulatory element-binding protein 2 (SREBP2) and the transcription factor zinc finger protein SNAI1, which bind IMPβ through their dimeric helix–loop–helix leucine zipper domain and zinc finger domains, respectively. Both IMPβ-binding domains are enriched in basic residues, and the flexible IMPβ superhelix adopts different helical pitches and uses different sites to bind these very differently sized and shaped domains57,58 (FIG. 2). Due to sterically clashing binding sites and/or allosteric conformation changes of IMPβ, the binding of these domains and the IMPα IBB to IMPβ is mutually exclusive, with the IBB binding with higher affinity. It is likely that more cargoes will be found to bind IMPβ through folded globular domains.
Other importins also bind folded domains of their import cargoes. IPO9 was recently shown to bind the highly basic histone H2A–H2B dimer by wrapping around its globular histone-fold domain59. IPO9 was also reported to bind very differently folded cargo regions such as the C-terminal HEAT repeats of serine-threonine-protein phosphatase PP2A and the homeobox domain of developmental regulator Aristaless60,61. Similar to IMPβ, IPO9 probably uses different sites and binding modes to bind structurally diverse cargo domains. IPO4, IPO5, IPO7, IPO8 and IPO11 all bind diverse folded cargo domains but little information is available about these interactions1,11. IPO5 binds to a helical region of the high-density lipoprotein component apolipoprotein A-I, IPO8 binds the globular single-domain eIF4E and IPO11 binds the compact five-helix matrix domain of the Rous sarcoma virus Gag polyprotein1,62.
Biportins IPO13, XPO4 and XPO7 also bind folded domains of their import cargoes. Recent structures showed IPO13 using different sites to bind the diverse domains of two different import cargoes: the exon junction complex MAGO-Y14 (REF.63) and SUMO-conjugating enzyme UBC9 (REF.64) (FIG. 3b). The yeast homologue of IPO13, PDR6, also imports yeast UBC9 but uses a binding mode different from that of IPO13 (REFS12,64,65). IPO13 and PDR6 bind different regions of UBC9 using their N-terminal and C-terminal HEAT repeats, respectively. XPO4 binds the three-helix HMG box domains of import cargoes Sex-determining Region Y protein (SRY) and SRY-box transcription factor 2 (SOX2)66. A recent proteomics study identified structurally diverse candidate import cargoes for XPO7 but it is not yet known whether they bind using linear elements or folded domains11.
Many export protein cargoes use folded domains to bind exportins (other than CRM1 and MSN5) and biportins. The exportin CSE1 (yeast homologue of CAS) wraps around both the ARM domain of autoinhibited KAP60 (yeast IMPα homologue) and RAN–GTP (FIG. 3a). XPO6 exports the profilin–actin complex, which has only folded globular domains and no IDRs; however, XPO6 binding to profilin–actin has not been biochemically demonstrated and it is unclear whether additional factors, potentially with linear motifs, are involved. Notably, the nuclear tumour suppressor protein RASSF1A appears necessary for RAN–GTP interaction with XPO6 and for nuclear export of profilin–actin67. Biportins IPO13, XPO4 and XPO7 also recognize diverse folded domains, such as histone fold heterodimers and helical homeodomains, in their export cargoes (reviewed elsewhere1). IPO13 binds the folded domain of export cargo eukaryotic translation initiation factor 1A (eIF1A) that binds to a site distinct from those for import cargoes68 (FIG. 3b). XPO4 binds the MH2 domain of SMAD3 (REF.69), as well as the SH3-like domain and OB domain of eIF5A (REFS70,71).
In summary, Kaps use many different binding sites to bind diverse folded cargo domains. The flexible superhelical architecture also enables Kaps to adopt various conformations to accommodate diverse cargo domains.
The combined use of motifs
A Kap may also bind to both a linear element and folded domain in a cargo. For example, folded domains immediately adjacent to linear cNLSs (such as in RCC1) require certain IMPα subtypes with more flexible ARM domains such as IMPα3 to accommodate the bulky fold29,72,73. The unusual IMPβ–IPO7 heterodimer binds both the globular histone-fold domain of histone H1 and its C-terminal disordered tail74. TNPO3 binds both the RS-NLS and the RRM domain of splicing factor 1 ASF1 (also known as SF2) (FIG. 2); the RS-NLS is the primary TNPO3-binding determinant and the RRM a secondary element1,36,75,76. Many TNPO3 cargoes have RRM domains but many candidate TNPO3 cargoes identified by proteomics do not75; it is unclear how prevalent TNPO3–RRM interactions are. Lastly, SNUPN binds CRM1 in a multipartite manner using an N-terminal NES, a folded globular m3g cap-binding domain and a disordered C-terminal tail, all binding to the outer convex surface of CRM1 (REFS41,42) (FIG. 3a). It is not known whether other cargoes also use folded domains, with or without an NES, to bind CRM1.
RNA cargoes
The exportins XPO5 and XPOT are known to export RNA rather than protein cargoes, being primary exporters of pre-microRNAs (pre-miRNAs) and tRNAs, respectively. Both XPO5 and LOS1 (Schizosaccharomyces pombe homologue of XPOT) wrap around their double-stranded RNA cargoes, making electrostatic interactions with the negatively charged RNA backbones and the pre-miRNA 3′ overhang or the tRNA 3′ CCA motif (FIG. 3a; reviewed in REF.2). Notably however, these RNA species can be exported by multiple exportins77. For instance, XPO5 also exports tRNAs but likely binds them differently to pre-miRNAs78,79.
Some RNA cargoes do not bind directly to their Kaps but bind via adaptor proteins. SNUPN is an import adaptor that binds m3G-cap-containing spliceosomal U small nuclear RNPs and IMPβ45. Many different RNAs including mRNA80, tRNA77, small nuclear RNA and small nucleolar RNA81, miRNA82,83, telomerase RNA84 and viral RNA85 are exported by CRM1, all by binding to different adaptor proteins, such as nuclear RNA export factor NXF3 (REF.86), RNA helicase DBP5 (REF.87), phosphorylated adaptor for RNA export (PHAX)88 and HIV transactivating protein Rev89, which drive export via their NESs. It is unclear whether the RNA cargoes make direct contact with CRM1 in addition to binding the adaptors.
In general, shape and charge complementarity govern all Kap–cargo (including protein and RNA) interactions. The flexible HEAT repeat architecture allows recognition of diverse signals and/or structural features/shapes of the cargoes. The use of adaptors further diversifies cargo recognition modes and expands Kap cargo repertoires.
Cargo loading and unloading via RAN–GTP
RAN–GTP, but not RAN–GDP, binds all Kaps to regulate cargo binding and allow cargo loading/unloading in both transport directions. All Kaps use their N-terminal portion comprising ~100 residues (HEAT repeats 1–4) to bind the switch 2 feature of RAN–GTP. Kaps also use additional HEAT repeats in the N termini and, sometimes, C termini to bind switch 1 and the basic patch of RAN–GTP33,42,63,65,68,70,75,90–94 (FIG. 1c). Additionally, IMPβ, KAPβ2, TNPO3, KAP121 and CSE1 use long acidic loops between or within HEAT repeats to bind switch 1 or the basic patch of RAN–GTP33,75,90,91,94. RAN–GTP binds tightly to all importins, has high-to-moderate affinity for biportins and shows weak to no binding to most unliganded exportins68,71,95 (TABLE 1). Here, we review the different ways Kaps use their flexible HEAT repeat architecture to achieve mutually exclusive binding of RAN–GTP versus import cargo, or the RAN–GTP-dependent binding of export cargo (FIG. 1d).
RAN–GTP-regulated cargo import
For nuclear import, the binding of RAN–GTP versus cargo is mutually exclusive. This selection is driven by the existence of a steric clash between import cargo and RAN–GTP when binding importin (steric mechanism), by the diverse conformational changes of the importin upon binding RAN–GTP that abolish cargo binding sites (allosteric mechanism) or a combination of the two mechanisms (FIGS 1b–d and and2).2). TNPO3 uses a steric clash mechanism, with overlapping RAN–GTP and RS-NLS binding sites75, whereas IMPβ primarily rearranges HEAT repeat packing into a conformation that is incompatible with cargo binding96,97. Other importins use combinations of steric and allosteric mechanisms. A long acidic loop between the helices of HEAT repeat 8 in KAPβ2, which is disordered in the KAPβ2–cargo complex, undergoes a drastic conformational change when it interacts with RAN–GTP, allowing it to engage the PY-NLS binding site and displace the cargo94. For KAP121, RAN–GTP binds at a site that partially overlaps with the IK-NLS binding site, and also causes the KAP121 superhelix to elongate, closing the IK-NLS binding pockets33.
Although, in most cases, binding of RAN–GTP to the importin is sufficient to release the cargo, there are a few reports of importin–cargo complexes where RAN–GTP alone is not able to mediate this release. The most clearly demonstrated example is the RAN–GTP inability to release histones H2A–H2B from IPO9 (REF.59) or from its yeast homologue KAP114 (REFS98,99). RAN–GTP is also unable to release another KAP114 cargo, the TATA-binding protein100, or the mRNA-binding protein NPL3 from its importin, MTR10 (REF.101). Other examples with conflicting data also exist, such as the release of RNA-binding proteins NAB2 and NAB4/HRP1 from yeast KAP104 (KAPβ2 homologue)31,102; further studies are needed to confirm the RAN–GTP insensitivity of this release. In all of these cases, nucleic acid targets (chromatin, RNA) of the cargoes are required along with RAN–GTP for efficient cargo release, suggesting that locations of cargo release in the nucleus may be specified by downstream functions of the cargoes. The mechanisms of this cargo release mediated by the combined effect of the nuclear target and RAN–GTP are unknown at this time.
RAN–GTP-regulated cargo export
To enable nuclear export, exportins show little or no binding to cargo or RAN–GTP alone, and instead bind both cooperatively. XPO5/MSN5 is an exception as it binds stably to RAN–GTP alone and to some primary miRNA precursors95,103,104, perhaps suggesting additional miRNA processing roles for this exportin104. Coupled cargo and RAN–GTP binding involves various large conformational changes in most exportins and direct contacts between RAN–GTP and cargo that stabilize the ternary complex (FIGS 1b–d and and3a;3a; CRM1 is an exception, see below). Unliganded CSE1 and XPO5 form closed-ring structures that open up when binding both cargo and RAN–GTP to accommodate both molecules inside the exportin solenoids91,93,103,105. Conversely, unliganded CRM1 and LOS1 adopt open conformations, but undergo compaction when binding cargo and RAN–GTP to interact with both molecules favourably; compaction of CRM1 is one of the many conformational changes that leads to the opening of the NES binding groove (reviewed elsewhere106 and more below) whereas the LOS1 superhelix clamps onto both RAN–GTP and cargo41,42,92,107. Additionally, binding of RAN–GTP displaces C-terminal autoinhibitory tails/helices of both CRM1 and XPO5, to facilitate the opening or closing of the exportin superhelix, respectively103,108,109.
As mentioned earlier, CRM1 is the only exportin where the RAN–GTP and cargo binding sites are distant, with no direct RAN–GTP–cargo contacts. This is because CRM1 uses its outer/convex surface to bind cargoes, whereas Ran–GTP interacts with the inner/concave surface. Coupling of RAN–GTP and cargo binding is thus driven by conformational changes to open or close the NES-binding groove in CRM1. The precise mechanisms for CRM1 export cargo dissociation vary in yeast and human even though both involve RanBP1. Yeast RanBP1 binds the inner CRM1 surface in the RAN–GTP–CRM1–cargo complex, which induces a conformational change that closes the NES-binding groove to release cargo110. By contrast, human RanBP1 strips RAN–GTP from the RAN–GTP–CRM1–cargo complex to cause subsequent cargo release111.
Cargo loading/unloading in biportins
Biportins use the same mechanisms as importins and exportins to achieve RAN–GTP selective loading/unloading of cargoes (FIG. 3b). RAN–GTP binds to the cargo (UBC9) binding sites of IPO13 and PDR6 to sterically displace the cargo; RAN–GTP binding also causes a slight compaction of the biportin superhelices64,65. However, RAN binding and slight compaction of IPO13 do not release import cargo MAGO-Y14, which requires export cargo eIF1A for efficient dissociation in vitro63,64, suggesting that import and export of some cargoes by may be coupled. For export cargoes, RAN–GTP binding to IPO13, PDR6 and XPO4 causes their superhelices to twist and tighten to achieve more contacts with export cargoes, and RAN–GTP also contacts cargoes to stabilize the export complexes65,68,70.
In summary, Kaps adapt to RAN–GTP binding quite differently to enable either nuclear import or export. Kap interactions with RAN–GTP either abolish binding sites for the import cargo to release cargo in the nucleus, or they help establish favourable binding sites for export cargo to increase its binding in the nucleus. RAN–GTP itself may also inhibit the binding of import cargo through steric hindrance or may facilitate the binding of export cargo by contacting both cargo and Kap.
Regulation of Kap-mediated transport
The nuclear versus cytoplasmic distributions of many proteins remain steady over the long term, but the distributions of many other proteins are dynamic, changing according to the progression of cell cycle, according to environmental cues or to generate specific cellular responses (FIG. 4a). This can be achieved by modifications of the Kap system via several mechanisms involving the cargo itself and/or the Kaps, which impact the transport rates of some or all cargoes in a particular direction.
Changes to cargoes
Studies of the IMPα–IMPβ and CRM1 systems show that higher cNLS/NES binding affinities correlate with greater nuclear or cytoplasmic localization within an optimal range, respectively27,112 (TABLE 1). Post-translational modifications (PTMs) in or adjacent to signals in the cargo that bind to Kaps can change Kap–cargo affinities and thus cause a change in their cellular localization (FIG. 4b). The effects of PTMs are usually highly dependent on their position within the full-length cargo proteins. This topic was previously reviewed113,114 and we highlight recent examples here. PTMs of NLSs/NESs, such as phosphorylation, acetylation or methylation, are known to affect Kap affinities. In some instances, the effects are predictable; for example, phosphorylation of the MSN5-binding cargo segments and the TNPO3-binding RS-NLS is critical for Kap binding, whereas arginine methylation of the FUS RGG regions impairs FUS interaction with KAPβ2 (REF.115). In other cases, the effects are difficult to predict; for example, phosphorylation of cNLSs inhibits import of some cargoes but enhances import of others. Recent examples of cNLS phosphorylation include serine phosphorylation of the bipartite cNLS linkers of tumour suppressor protein 53BP1 (REF.116) and peptide hormone GLP1 (REF.117) that decreased importin binding and nuclear import. Acetylation of lysines in the cNLSs of the glucose metabolic enzyme PFKFB3 (REF.118) and the antiviral protein IFIX119 also inhibited import, but lysine acetylation of the tyrosyl-tRNA synthetase (TyrRS) cNLS promoted import; in the latter case, the unacetylated cNLS is helical and acetylation is suspected to destabilize the helix to generate an extended and active cNLS120. Effects of NES modifications are similarly difficult to predict. Phosphorylation near the NES of transcription factor TFEB121 increased CRM1 binding but phosphorylation of the hypoxia-inducible factor HIF2α NES is inhibitory122. Methylation of a lysine adjacent to the NES of the transcriptional regulator YAP1 also inhibited CRM1 binding123.
Modifications that change the accessibility of NLSs/NESs to Kaps can also affect Kap–cargo binding (FIG. 4b). Conformational changes of the cargo or changes in cargo oligomerization may expose or sequester targeting signals. Both types of changes may be induced by PTMs or by binding to additional macromolecule partners. Classic examples include tetramerization of the tumour suppressor protein p53 that masks its NES, and the masking of the NLS in the transcription factor NF-κB/RELA by its binding to a regulatory partner protein IκB (reviewed in REF.124). More recently, dephosphorylation of serine residues distant from the putative NLS of transcription factor YB1 was suggested to increase NLS accessibility and nuclear entry during the late G2/M phase of the cell cycle, mediated by a conformational change125. By contrast, phosphorylation of the apoptotic regulator caspase 2 causes binding of scaffolding protein 14-3-3 that masks the caspase 2 cNLS, inhibiting its nuclear import and activation126,127. Similar control of NES accessibility has also been observed. Acetylation of the N-terminal IDR region in the fly homeobox protein Ubx, which is far from its NES, seems to mask the NES via an unknown mechanism, to inhibit export128.
Mutations of NLSs and NESs, such as those that occur in neurodegenerative diseases and cancers, change cargo localization by disrupting functional targeting signals or by generating new ones. Many mutations in and/or adjacent to the PY-NLSs of mRNA binding proteins FUS and heterogeneous nuclear RNPs (hnRNPs) A1 and A2B1 were found in patients with familial amyotrophic lateral sclerosis and multisystem proteinopathy129–132. Mutations in the PY-NLSs of hnRNPs H1 and H2 are also thought to drive the mental retardation, X-linked, syndromic, Bain type (MRXSB) disease133,134. All these mutations likely decrease KAPβ2 binding and disrupt nuclear import. Conversely, frameshift mutations in nucleophosmin create potent NESs that lead to aberrant CRM1-mediated nuclear export in a subtype of acute myeloid leukaemia135. The W1038X truncation mutation in tumour suppressor PALB2 also exposes an otherwise occluded NES, mislocalizing the protein in breast/ovarian cancers136. Alternatively, amyotrophic lateral sclerosis-linked mutations in superoxide dismutase 1 (SOD1) cause SOD1 misfolding that exposes an NES for CRM1 export137.
In summary, modifications to cargoes, both within and outside Kap-binding regions, can change Kap–cargo binding affinities in many different ways. These changes to cargoes ultimately result in their altered subcellular localization.
Changes to Kaps
Multiple examples of PTMs of Kaps have been reported. These modifications can change Kap–cargo affinities, either by direct chemical changes to cargo binding sites or, indirectly, by allosterically changing the conformations of cargo binding sites (FIG. 4c). An example of the former includes S-nitrosylation of cysteines at the NES-binding groove of CRM1 that inhibits NES binding138. Examples of allosteric mechanisms include phosphorylation of the C-terminal autoinhibitory tail of CRM1 that activates export139, and XPO5 phosphorylation by kinase ERK that enables binding to the peptidyl-prolyl cis/trans isomerase PIN1, which stabilizes a XPO5 conformation that disfavours pre-miRNA binding140. More unusual examples of Kap modifications include disulfide linkage of KAPβ2 to the transcription factor FOXO4 in response to reactive oxygen species, which leads to FOXO4 nuclear entry141.
PTMs of Kaps can also modulate interactions with other proteins to affect nuclear transport. sumoylation of KAP114 stimulates import by facilitating RAN-induced cargo release, but the mechanism is still unclear142. Phosphorylation of IPO13 by kinase PKA upon hormonal stimulation reduced nuclear entry of the Kap and increased binding of cargo transcription factor PAX6 but reduced its nuclear import, suggesting that factors other than Kap–cargo affinity decreased PAX6 import143. PTMs of Kaps also lead to changes in their cellular concentrations and thus change their global import/export profiles. CRM1 phosphorylation by kinase AKT3 induces CRM1 degradation, causing nuclear accumulation of cargo transcriptional coactivator PGC1α, which is critical for regulating mitochondrial biogenesis and driving prostate cancer metastasis144,145.
Many PTMs and mutations of Kaps as well as changes in their cellular concentrations are associated with diseases. CRM1 has emerged as a viable therapeutic target146,147 as it is overexpressed in many cancers, and mutations in its NES-binding groove are associated with tumorigenesis and with poor prognosis in blood cancers148–151. The oncogenic, charge-reversal E571K mutation in CRM1 was reported to affect binding of a subset of NESs/cargoes152–154. A SILAC-based mass spectrometry study identified some cargoes with altered cellular localization in B cells carrying the heterozygous E571K CRM1 mutation153. A separate study showed that export cargo P-body protein eIF4E transporter was mislocalized to the nucleus in cultured cells and in leukaemia cells carrying the E571K mutation in CRM1. This was accompanied by decreased binding of mutated CRM1 to eIF4E transporter, suggesting that E571K mutation impairs cargo–CRM1 interaction, resulting in defective export154. However, a recent study showed that this mutation does not affect CRM1-mediated export, but that mutated CRM1 is instead enriched at the nuclear envelope via an unknown IMPβ-dependent mechanism with an unknown effect155. It remains unclear how CRM1 mutations drive tumorigenesis. XPO5 overexpression, mutations and modifications are also found in various cancers156. Phosphorylation and sumoylation of XPO5 suppress pre-miRNA export, which leads to aberrant expression of oncogenic proteins140,157.
Similar to cargo modifications, Kap modifications affect cellular localization of their cargoes via many different mechanisms. However, the modification of a Kap likely has a larger effect as it affects at least a subset if not all of its cargoes. Diverse changes to cargoes and Kaps thus allow complex regulation of the nucleocytoplasmic transport and the subcellular distribution of cargoes that is integrated into a wider cell biology, allowing appropriate responses to various cues. However, it is also apparent that aberrant changes to Kaps and/or their cargo can perturb the physiological distribution of cargo, which may contribute to disease.
Kaps in biogenesis of large assemblies
Many large (hundreds to thousands of kilodaltons) macromolecular machineries/assemblies are either assembled in the nucleus and/or function in the nucleus, raising questions of whether they are assembled before or after nuclear import and how assembly is coordinated with import/export. We review these questions for three assemblies that each function in major gene expression processes: replication, transcription and translation.
Import of the replisome core
Assembly of the multiprotein DNA replication machinery, or replisome, on DNA has been reviewed extensively, but its trafficking to the nucleus is rarely discussed. Components of the eukaryotic replisome core include the CMG helicase (named for its components — cell division control protein 45 (CDC45), minichromosome maintenance complex (MCM2–MCM7) and the GINS complex (SLD5–PSF1–PSF2–PSF3)), three DNA polymerases (Polε, Polδ and Polα-primase), processivity clamp proliferating cell nuclear antigen (PCNA) and clamp loader replication factor C (RFC), clamp unloader RFC-like complexes (RLC) and the single-stranded DNA-binding protein RPA (Fig. 5a). The components of the CMG helicase seem to be imported separately prior to their assembly in the nucleus. Yeast IMPα–IMPβ is responsible for importing the bipartite cNLS-containing CDC45 (REFS158,159) and the MCM2–MCM7 complex (where MCM2 and MCM3 have cNLSs)160–162. The latter is assembled in the cytoplasm by loading factor CDT1, which is imported with the MCM complex to load it onto DNA163,164. It is not known how the GINS complex is trafficked. Mechanisms of trafficking for the remaining replisome components are largely unclear, with some conflicting reports. The p66 and p125 subunits of human Polδ have putative cNLSs165,166. However, the C-terminal α-helical putative cNLS of p125 is atypical and it is unclear whether it is functional for import167. No information is found on nuclear import of Polε. The mouse Polα-primase is imported as a heterotetrameric complex; putative cNLSs in its p180 and p54 subunits direct import of smaller assemblies but are dispensable for import of the larger complex168,169. A bipartite cNLS in p180 is buried in a folded domain, unlikely to drive import170,171. The homotrimeric PCNA binds IMPβ but not IMPα, and the IMPβ-binding region (conserved from human to yeast) is partially located at the trimer interface172,173, suggesting that PCNA is trafficked as a monomer either by binding directly to an importin or by binding the cNLS-containing cyclin-dependent kinase inhibitor p21 (REF.174). The large subunit of the penta meric human RFC clamp loader, replication factor C subunit 1 (RFC1), and that of an RLC clamp unloader, ATPase family AAA domain-containing protein 5 (ATAD5), have putative N-terminal NLSs that have not been verified experimentally175,176. Nuclear import of RPA requires its C-terminal tetramerization domain, but there are conflicting reports of different importers including KAP95 and MSN5 in yeast, and the involvement of RIP as an IBB-containing IMPβ adaptor for RPA in the frog46,177–179.
Some replisome components are found to also exit the nucleus. The MCM complex is exported after replication in subsequent S, G2 and M phases — probably by CRM1 (the NES was identified in yeast MCM3) — which is regulated by cyclin-dependent kinases162,180. Human p125 is also exported by CRM1 but no NES has been reported166. CRM1 exports human PCNA by binding an NES accessible only in the monomer181, consistent with export of monomeric PCNA and its assembly and disassembly on DNA.
In summary, a few replisome components have been found to carry cNLSs and some others may ‘piggy-back’ on cNLS-containing proteins, such as the PCNA binding to p21, for import. The use of cNLSs for the import of the replisome components is accompanied by the involvement of KAP60 in regulating G1/S and G2/M transitions in yeast, suggesting that key DNA replication and cell cycle checkpoint proteins are imported by IMPα–IMPβ to coordinate cell cycle progression with genome replication182,183. However, import mechanisms for the majority of replisome components remain unknown. Similarly, CRM1-mediated export of three replisome components was reported, but it is unclear whether more components are actively exported out of the nucleus and how this export impacts the activity of the replisome.
Import of RNA polymerase II
We focus on RNA polymerase II (RNA Pol II) to describe nuclear import of a large transcription machinery (FIG. 5b). The 12-subunit RNA Pol II is fully assembled in the cytoplasm before import (reviewed in REF.184). Yeast RNA Pol II nuclear localization protein IWR1 contains a cNLS conserved from yeast to human, which binds IMPα–IMPβ, and is believed to be the major import adaptor for assembled RNA Pol II185; the effect of the human homologue of IWR1 has not been verified. Subunit sub-assemblies and small individual subunits can also enter the nucleus in yeast, prior to assembly, by active transport and passive diffusion, respectively186. Regulator of transcription 1 (RTR1; RPAP2 in human) may be the import factor that mediates the entry of individual large subunits of the RNA Pol II and subunit assemblies186,187; however, no NLS has been identified for RTR1. RPAP2 is also implicated in the import of human RNA Pol II187. Glycine-proline-asparagine (GPN)-loop GTPases were also reported as putative import factors in both human and yeast (reviewed in REF.184). Pull-down mass spectrometry studies revealed possible interactions of human GPN1/2/3 with IMPα2 (although no cNLSs were identified)188, but no such interaction was shown for yeast NPA3 (GPN1 homologue)189. However, some studies suggested that GPNs act upstream of IWR1 as cytosolic chaperones for RNA Pol II assembly, rather than as import adaptors190,191.
Several RNA Pol II import adaptors are exported from the nucleus via CRM1, but export of the polymerase has not been reported. The RNA Pol II machinery may be passively retained in the nucleus due to its large size and association with DNA, consistent with studies showing that the nuclear proteome is primarily maintained by complex assembly and passive retention rather than active export6. IWR1 is displaced from RNA Pol II by transcription initiation factors and then exported via a putative NES192. RTR1/RPAP2 is exported in a CRM1-dependent manner (but no NES has been identified)187,193, possibly through binding NES-containing GPNs (the NES was so far identified in GPN1/NPA3 (REF.194); reviewed elsewhere184).
In summary, generally, RNA Pol II complex assembles in the cytoplasm and uses import adaptors for nuclear import which are recycled back to the cytoplasm by CRM1. However, other mechanisms for RNA Pol II nuclear import, involving individual subunits and their smaller assemblies, were also reported. The mechanism of function of these import adaptors and their evolutionary conservation remain mostly unclear. The contribution of the multiple import mechanisms to RNA Pol II biogenesis and function also remains to be addressed.
Import/export of ribosome components
Ribosomes are complex machineries composed of ~80 individual ribosomal proteins (RPs) together with ribosomal RNAs. The assembly of pre-ribosomal particles occurs in the nucleoplasm, where ribosomal RNAs are transcribed, and hence necessitates import of the RPs into the nucleus. Despite the importance of the process, nuclear import mechanisms are so far known for only a few RPs (FIG. 5c). Early studies suggested that some RPs (eS1, eS7, eS26, eL6, eL20, uL4, uL14, uL18 and uL23) could be imported by multiple importins. Yeast cells primarily use KAP123 for RP import; the roles of other Kaps are much less pronounced, and they mostly function as backup pathways to reduce competition for import by KAP123 or when KAP123 is absent195–201. Interestingly, yeast cells show a particularly large import flux for RPs, and KAP123 is correspondingly the most abundant yeast Kap197, underscoring the observation that cellular concentrations of Kaps are linked with their respective cargo loads202. Although the role of KAP123 in RP import is dominant, not all RPs use this route for import. For example, uL11 is exclusively imported by IPO11 (REF.203). Furthermore, cNLS-containing uS3 and PY-NLS-containing uL4 are imported by IMPα–IMPβ and KAPβ2, respectively; this transport is orchestrated by uS3 and uL4 while bound to their cognate chaperones, yeast ankyrin repeat-containing protein 1 (YAR1)204,205 and assembly chaperone of RPL4 (ACL4)206,207, which may prevent these RPs from aggregation and/or ensure proper ribosome assembly. Transport of uL5 and uL18 in yeast is also supported by a chaperone, Symportin 1 (SYO1), which binds both RPs as well as KAP104 via its N-terminal basic PY-NLS for import208. RAN–GTP is presumed to release most imported RPs in the nucleus (see above discussion in Cargo loading and unloading via RAN–GTP). Nevertheless, several exceptions were reported. For example, RAN–GTP is inefficient in releasing uL23 from IPO7, which suggests the involvement of other factors196. Release of RPs eS26 and uL14 from KAP123 and KAP121/KAP123 also occurs in a RAN–GTP-insensitive manner and requires the action of dedicated chaperones/escortins 20S ribosomal RNA accumulation protein 2 (TSR2) (for eS26)200,209 and Bacterioferritin co-migratory protein 1 (BCP1) (for uL14)201. It is unclear what prevents these escortins from prematurely dissociating cytoplasmic importin–RP complexes.
Following import, RPs are assembled and matured into the pre-60S and pre-40S ribosomal subunits before nuclear export for further cytoplasmic processing and the assembly of the mature ribosome. This export occurs via two routes, involving CRM1 and mRNA export receptor Mex67-MTr2 (reviewed elsewhere210). The pre-60S subunit is exported by CRM1 via NES-containing adaptor protein nonsense-mediated mRNA decay protein 3 (NMD3)211, whereas ribosome biogenesis factors low-temperature viability protein 1 (LTV1), partner of NOB1 (PNO1) and RIO (right open reading frame)-type kinase 2 (RIO2) were proposed to function as redundant export adaptors for pre-40S export212–214.
Overall, similar to the replisome, import mechanisms are known for only a small fraction of RPs. Unlike replisome components and RNA Pol II, which seem to use almost exclusively the IMPα–IMPβ pathway, RPs seem to use various importins. Because eukaryotic cells require hundreds of thousands to millions of ribosomes215,216, the availability of multiple import pathways for RPs may reduce the import burden for individual Kaps, as well as provide multiple backup pathways, to ensure efficient delivery of RPs into the nucleus. Assembled ribosomal subunits also use different pathways and adaptors for nuclear exit.
Conclusions and perspectives
Importins, exportins and biportins all share a common spiral-shaped or ring-shaped architecture that is conformationally adaptable with large surface areas to bind many different partners (FIGS 1b–d and and4).4). Unusual surface chemistries of Kaps include hydrophobic residues as well as arginines, histidines and cysteines that confer solubility, allowing them to permeate the FG-repeat populated NPC barrier (Supplementary Boxes 1 and 3) and act as anti-aggregation chaperones for their cargo (Supplementary Box 4). Kaps mediate the majority of nucleocytoplasmic protein traffic with the import load relatively evenly divided among the ten importins and biportins; it is still unclear how traffic out of the nucleus is parsed. Kaps bind to NLS/NES linear motifs and/or folded domain(s) of cargoes (TABLE 1), and the interactions are regulated in many different ways to control import versus export versus nuclear/cytoplasmic retention to, ultimately, control localization and function of a plethora of macromolecules (FIG. 4). Finally, the biogenesis of many gene expression machineries is coordinated with the nucleocytoplasmic traffic of their components, but the current knowledge of the Kaps and signals responsible for these trafficking events is far from complete (FIG. 5).
Our knowledge of how macromolecules are trafficked between the nucleus and the cytoplasm has advanced tremendously since discovery of the Kap family, the RAN GTPase and its regulators in the 1990s. For some Kaps, namely IMPα–IMPβ, KAPβ2, TNPO3 and CRM1, we know many of the cargoes and how these Kaps recognize these cargoes, but much of the knowledge for the other Kaps is missing. More generally, it is still unclear why nucleocytoplasmic traffic is mediated by 20 different Kaps. Is this diversity in Kaps related to different activities of individual Kaps in regulating cell function? Or, perhaps, could the presence of multiple Kaps be linked to the robustness of nucleocytoplasmic transport? What are the mechanistic differences between importins, exportins and biportins, and how is the directionality of transport orchestrated by individual Kap properties? Will we learn in the future that importins/exportins are all biportins?
An emerging theme in Kap–cargo recognition, observed in the IMPα–IMPβ, KAPβ2 and CRM1 systems, involves Kap-binding elements that are very diverse in both sequence and structure, often used in multipartite and combinatorial manners. Such promiscuity in cargo recognition allows each Kap to function as a transport receptor for hundreds to thousands of different protein cargoes. Cargo selection by Kaps extends beyond very diverse NLS/NES sequences to the binding of diverse folded domains. It is unclear whether any common patterns exist for interactions of Kaps with folded domains of cargoes. Specifically, it remains to be addressed whether Kaps are capable of recognizing particular domain types or whether the recognition of folded domains is specific to individual domains, and therefore not general. Missing knowledge of how Kaps bind cargoes contributes to the current inability to robustly predict cargoes for the different Kaps. NLS/NES/cargo prediction for even the best studied IMPα–IMPβ and CRM1 systems is still poor (BOX 2). Expanding the knowledge of how Kaps bind cargoes will also lead to design/development of Kap-specific inhibitors that currently do not exist for most Kaps. Nuclear import and export inhibitors are not just critical as laboratory reagents to control nucleocytoplasmic localization of macromolecules of interest but also as therapeutic agents for the many diseases (cancer, neurodegeneration, inflammation and viral infections) that arise from aberrant nucleocytoplasmic trafficking (BOX 1). Lastly, it is clear that Kap–cargo interactions are regulated to control protein localization that affect many cellular processes such as cell cycle regulation and environmental sensing. Are the regulatory mechanisms used for nucleocytoplasmic transport also used in non-transport functions of Kaps, such as during mitosis? Some examples in spindle assembly and cytokinesis seem to suggest so217–220, but more studies are needed to focus on Kaps in non-transport roles.
Acknowledgements
This work was funded by the National Institute of General Medical Sciences (NIGMS) of the National Institutes of Health (NIH) under Awards R01GM069909 and R35GM144137 (Y.M.C.), the Welch Foundation Grant I-1532 (Y.M.C.), Cancer Prevention Research Institute of Texas (CPRIT) Grant RP180410 (Y.M.C.), support from the Alfred and Mabel Gilman Chair in Molecular Pharmacology, Eugene McDermott Scholar in Biomedical Research (Y.M.C.), the Gilman Special Opportunities Award (H.Y.J.F.) and NIGMS Molecular Biophysics Training Program T32GM131963 (C.E.W.).
RelATed liNKs
cNLs Mapper: http://nls-mapper.iab.keio.ac.jp
iNsP: http://www.csbio.sjtu.edu.cn/bioinf/INSP/
LocNes: http://prodata.swmed.edu/LocNES/LocNES.php
Nesmapper: https://sourceforge.net/projects/nesmapper/
NetNes: https://services.healthtech.dtu.dk/service.php?NetNES-1.1
NLstradamus: http://www.moseslab.csb.utoronto.ca/NLStradamus/
NoLogo: https://github.com/mppl1/NoLogo
NucPred: https://nucpred.bioinfo.se/nucpred/
Nucimport: http://bioinf.scmb.uq.edu.au:8080/NucImport/
PsORT: https://www.genscript.com/psort.html
PsORT ii: https://psort.hgc.jp/form2.html
PredictNLs: https://rostlab.org/owiki/index.php/PredictNLS
seqNLs: http://mleg.cse.sc.edu/seqNLS/
Footnotes
Competing interests
Y.M.C. is a consultant for Faze Medicines. The remaining authors declare no competing interests.
Supplementary information
The online version contains supplementary material available at https://doi.org/10.1038/s41580-021-00446-7.
References
Full text links
Read article at publisher's site: https://doi.org/10.1038/s41580-021-00446-7
Read article for free, from open access legal sources, via Unpaywall: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10101760
Citations & impact
Impact metrics
Citations of article over time
Alternative metrics
Discover the attention surrounding your research
https://www.altmetric.com/details/121318839
Smart citations by scite.ai
Explore citation contexts and check if this article has been
supported or disputed.
https://scite.ai/reports/10.1038/s41580-021-00446-7
Article citations
Targeted protein relocalization via protein transport coupling.
Nature, 633(8031):941-951, 18 Sep 2024
Cited by: 1 article | PMID: 39294374
Interdependence between Nuclear Pore Gatekeepers and Genome Caretakers: Cues from Genome Instability Syndromes.
Int J Mol Sci, 25(17):9387, 29 Aug 2024
Cited by: 0 articles | PMID: 39273335 | PMCID: PMC11394955
Review Free full text in Europe PMC
A new type II CHH neuropeptide involves ovarian development in the peppermint shrimp, Lysmata vittata.
PLoS One, 19(8):e0305127, 01 Aug 2024
Cited by: 0 articles | PMID: 39088423 | PMCID: PMC11293640
Importin-7-dependent nuclear translocation of the Flavivirus core protein is required for infectious virus production.
PLoS Pathog, 20(8):e1012409, 15 Aug 2024
Cited by: 0 articles | PMID: 39146232 | PMCID: PMC11326614
Superoxide signalling and antioxidant processing in the plant nucleus.
J Exp Bot, 75(15):4599-4610, 01 Aug 2024
Cited by: 1 article | PMID: 38460122 | PMCID: PMC11317529
Review Free full text in Europe PMC
Go to all (78) article citations
Data
Data behind the article
This data has been text mined from the article, or deposited into data resources.
Protein structures in PDBe (Showing 25 of 25)
-
(1 citation)
PDBe - 2X1GView structure
-
(1 citation)
PDBe - 3ZJYView structure
-
(1 citation)
PDBe - 3A6PView structure
-
(1 citation)
PDBe - 3W3XView structure
-
(1 citation)
PDBe - 3IBVView structure
-
(1 citation)
PDBe - 2QMRView structure
-
(1 citation)
PDBe - 3W3ZView structure
-
(1 citation)
PDBe - 2X19View structure
-
(1 citation)
PDBe - 4FGVView structure
-
(1 citation)
PDBe - 3GJXView structure
-
(1 citation)
PDBe - 2XWUView structure
-
(1 citation)
PDBe - 1QBKView structure
-
(1 citation)
PDBe - 5YU7View structure
-
(1 citation)
PDBe - 3W5KView structure
-
(1 citation)
PDBe - 4C0QView structure
-
(1 citation)
PDBe - 1Z3HView structure
-
(1 citation)
PDBe - 3ZKVView structure
-
(1 citation)
PDBe - 2BKUView structure
-
(1 citation)
PDBe - 3ND2View structure
-
(1 citation)
PDBe - 1UKLView structure
-
(1 citation)
PDBe - 1QGKView structure
-
(1 citation)
PDBe - 4C0OView structure
-
(1 citation)
PDBe - 4C0PView structure
-
(1 citation)
PDBe - 2H4MView structure
-
(1 citation)
PDBe - 3ICQView structure
Show less
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Dissecting in vivo steady-state dynamics of karyopherin-dependent nuclear transport.
Mol Biol Cell, 27(1):167-176, 04 Nov 2015
Cited by: 6 articles | PMID: 26538027 | PMCID: PMC4694755
Karyopherin beta 2B participates in mRNA export from the nucleus.
Proc Natl Acad Sci U S A, 99(22):14195-14199, 16 Oct 2002
Cited by: 23 articles | PMID: 12384575 | PMCID: PMC137860
How to operate a nuclear pore complex by Kap-centric control.
Nucleus, 6(5):366-372, 01 Jan 2015
Cited by: 24 articles | PMID: 26338152 | PMCID: PMC4915502
Review Free full text in Europe PMC
[Organization and regulation of nucleocytoplasmic transport].
Mol Biol (Mosk), 44(2):211-228, 01 Mar 2010
Cited by: 1 article | PMID: 20586181
Funding
Funders who supported this work.
NIGMS NIH HHS (3)
Grant ID: R35 GM141461
Grant ID: T32 GM131963
Grant ID: R01 GM069909