Evolutionary conservation of the U7 small nuclear ribonucleoprotein in Drosophila melanogaster
Abstract
The U7 snRNP involved in histone RNA 3′ end processing is related to but biochemically distinct from spliceosomal snRNPs. In vertebrates, the Sm core structure assembling around the noncanonical Sm-binding sequence of U7 snRNA contains only five of the seven standard Sm proteins. The missing Sm D1 and D2 subunits are replaced by U7-specific Sm-like proteins Lsm10 and Lsm11, at least the latter of which is important for histone RNA processing. So far, it was unknown if this special U7 snRNP composition is conserved in invertebrates. Here we describe several putative invertebrate Lsm10 and Lsm11 orthologs that display low but clear sequence similarity to their vertebrate counterparts. Immunoprecipitation studies in Drosophila S2 cells indicate that the Drosophila Lsm10 and Lsm11 orthologs (dLsm10 and dLsm11) associate with each other and with Sm B, but not with Sm D1 and D2. Moreover, dLsm11 associates with the recently characterized Drosophila U7 snRNA and, indirectly, with histone H3 pre-mRNA. Furthermore, dLsm10 and dLsm11 can assemble into U7 snRNPs in mammalian cells. These experiments demonstrate a strong evolutionary conservation of the unique U7 snRNP composition, despite a high degree of primary sequence divergence of its constituents. Therefore, Drosophila appears to be a suitable system for further genetic studies of the cell biology of U7 snRNPs.
Keywords
INTRODUCTION
The formation of nonpolyadenylated histone mRNA 3′ ends in metazoans is the only RNA processing reaction known to be devoted entirely to the expression of a specific protein family (for review, see Müller and Schümperli 1997; Dominski and Marzluff 1999; Marzluff and Duronio 2002). This special RNA maturation pathway is intimately linked to the cell cycle regulation of histone gene expression (Harris et al. 1991). However, plant and fungal histone mRNAs, which are also cell cycle–regulated, are polyadenylated and appear to be generated by the same RNA processing pathway used by other protein-coding genes of these organisms.
Although it shares the need for an Sm-type snRNP with pre-mRNA splicing and certain biochemical features with the 3′ end cleavage/polyadenylation reaction (for discussion, see Müller and Schümperli 1997), histone RNA processing is biochemically distinct from both of these RNA maturation pathways. Several trans-acting factors for histone RNA processing have been identified and studied, primarily in vertebrate systems.
-
The minor U7 snRNP binds to the purine-rich histone downstream element (HDE; Schaufele et al. 1986; Bond et al. 1991) and thereby serves as a molecular ruler, positioning the yet unknown cleavage activity near the processing site (Scharl and Steitz 1994,Scharl and Steitz 1996). U7 snRNA is 58–71 nucleotides long, depending on the species, and like spliceosomal snRNAs, it bears a trimethyl guanosine cap at its 5′ end. Following the 5′ region complementary to the HDE, U7 snRNA contains a noncanonical Sm-binding site (AAUUUGUCUAG in mammals), followed by a long hairpin structure and a very short single-stranded 3′ extension.
-
The histone hairpin binding protein (HBP; Vasserot et al. 1989; Martin et al. 1997), also known as stem–loop binding protein (SLBP; Wang et al. 1996) binds to a highly conserved RNA hairpin just upstream of the cleavage site. This interaction contributes to efficient histone RNA processing by favoring the formation of the histone pre-mRNA/U7 snRNP complex (Streit et al. 1993; Spycher et al. 1994; Dominski et al. 1999).
-
This effect of HBP on processing appears to be mediated by a 100-kD zinc finger protein, ZFP100, that binds to the HBP/hairpin complex (Dominski et al. 2002a) and also interacts with the N terminus of the U7 snRNP-specific protein Lsm11 (Pillai et al. 2003; see below).
However, the details of the interactions between these known processing factors still require further investigation, and some processing factors may still await their discovery.
Our recent studies of the protein composition of vertebrate U7 snRNPs have led to the understanding that a special type of Sm core structure assembles around the Sm-binding site of U7 snRNA. Whereas the Sm core of spliceosomal snRNPs contains the seven Sm proteins B/B′, D1, D2, D3, E, F, and G (for review, see Kambach et al. 1999), the U7 Sm core has Lsm10 and Lsm11 replacing Sm D1 and D2 (Pillai et al. 2001, 2003). This unique Sm core composition has two important implications: First, Lsm11, through its long N-terminal extension, participates in histone RNA processing, and it may do so partly through its interaction with ZFP100 (Pillai et al. 2003; see above). Second, the cell must be able to discern between snRNAs with only slightly different Sm-binding sites, and form the required type of Sm core—spliceosomal or U7-specific—without making mistakes. It apparently does so by preforming specific assembly complexes containing the survival of motor neurons protein (SMN), several Gemin proteins, and either Sm D1/D2 or Lsm10/11, in combination with the other Sm proteins (Pillai et al. 2003). After assembly, the U7 snRNP is imported into the nucleus, where it is highly enriched in Cajal bodies (Wu and Gall 1993; Frey and Matera 1995). Thus, the U7 snRNP not only is the key component and nucleation center of the histone RNA processing complex but also represents an important model for studying the processes of snRNP assembly, nuclear import, and Cajal body formation.
Both HBP and the hairpin are highly conserved and can be identified by sequence comparison in metazoans as well as in certain lower organisms such as Chlamydomonas and Volvox (Wittop Koning and Schümperli 1994; http://www.izb.unibe.ch/res/schuehome/schuemperli/hbppep.html). Moreover, HBP orthologs have been characterized biochemically in mammals, Xenopus, Drosophila, and Caenorhabditis elegans. In contrast, the HDE is conserved to some extent between vertebrates and echinoderms, but in lower invertebrates, the sequences downstream of histone mRNA 3′ ends are divergent, and an HDE is therefore difficult to identify (for review, see Wittop Koning and Schümperli 1994; Dominski and Marzluff 1999). Thus, until recently, HDEs or U7 snRNAs had not been defined in invertebrates other than sea urchins, and orthologs of Lsm10 or ZFP100 could not be identified by sequence similarity searches (Pillai et al. 2001; Dominski et al. 2002a). It was not clear if the inability to find these features was due to poor evolutionary conservation of primary sequences, or if histone RNA 3′ end processing in lower invertebrates operated by a mechanism independent of a U7 snRNP. However, by using an in vitro processing system from Drosophila tissue culture cells, Dominski et al. (2002b), characterized certain mutations downstream of a Drosophila histone RNA processing site that appeared to define a Drosophila HDE, and they also succeeded in characterizing a Drosophila U7 snRNA of 71 nucleotides (Dominski et al. 2003).
Here, we have taken a different approach to characterize the Drosophila U7 snRNP. The recently obtained sequence for the large Lsm11 protein (Pillai et al. 2003) has allowed us to detect potential invertebrate orthologs. Moreover, thanks to the recently published mosquito genome sequence and by using recursive sequence similarity searches, we have succeeded in detecting potential invertebrate Lsm10 orthologs as well. We have studied the interactions of the putative Drosophila Lsm10 and Lsm11 proteins (termed dLsm10 and dLsm11, respectively) with other Sm proteins, as well as with U7 snRNA in Drosophila tissue culture cells. Moreover, we demonstrate an indirect interaction of dLsm11 with histone pre-mRNA. Finally, dLsm10 and dLsm11 can assemble into U7 snRNPs in mammalian cells. All these data indicate that, despite considerable differences in primary sequences, the overall architecture of the U7 snRNP and the mechanism of histone RNA 3′ end processing have been conserved over at least ~1000 Myr, that is, the evolutionary time separating arthropods and vertebrates.
RESULTS
Detection of potential invertebrate orthologs for Lsm10 and Lsm11
Our Lsm10 database (http://www.izb.unibe.ch/res/schuehome/schuemperli/Lsm10.html) currently comprises six complete mammalian sequences (human, mouse, rat, pig, bovine, and dog), as well as two amphibian (Xenopus laevis, Silurana tropicalis), one avian (chicken), and four fish sequences (fugu, medaka, rainbow trout, and zebrafish). By using the lower vertebrate sequences, we could identify potential orthologs from two tunicate species (Ciona intestinalis and Boltenia villosa). Additional iterative BLAST searches revealed a partial sequence from the mosquito Anopheles gambiae and a potential ortholog in Drosophila melanogaster. It is evident from the alignment of these invertebrate Lsm10 sequences with representative vertebrate ones (Fig. 1) that the vertebrate Lsm10 proteins are well conserved, but the sequences found in invertebrates are so divergent that the identification of the corresponding proteins as true Lsm10 orthologs, in the absence of biochemical evidence, remains questionable. However, it is also noteworthy that many of the amino acid changes are conservative in nature.
By using the same iterative search strategy, we found putative invertebrate Lsm11 orthologs in C. intestinalis and D. melanogaster (Fig. 2). Their detection was facilitated by the presence of a long N-terminal extension and of an extensive spacer separating the two Sm motifs in Lsm11 (Pillai et al. 2003). The spacer is poorly conserved in sequence or length, but it is always longer than in any of the other Sm/Lsm proteins. Moreover, the N-terminal extension contains several conserved sequence patches that can also be recognized in the invertebrate proteins (Fig. 2). An Lsm11 database containing additional sequences is maintained at our Web site (http://www.izb.unibe.ch/res/schuehome/schuemperli/Lsm11.html).
Drosophila Lsm10 and Lsm11 interact with each other and with Sm proteins
To analyze whether the putative Drosophila orthologs dLsm10 and dLsm11 associate with each other and with Sm protein-containing complexes, we cloned the corresponding open reading frames (ORFs), each containing an HA-tag at the N terminus, into an expression vector under the control of the Drosophila metallothionein promoter (see Materials and Methods). The HA-dLsm11 protein was well expressed after transient transfection in Drosophila S2 cells (Fig. 3A,B, input lanes). However, the expression of HA-dLsm10 was much lower, so the protein was sometimes barely detectable by immunoblotting crude nuclear extract with anti-HA antibody (see input lane in Fig. 3B). However, importantly, HA-dLsm10 expressed in S2 cells could be coprecipitated by a polyclonal antibody raised against bacterially expressed, GST-tagged dLsm11 (Fig. 3A, lower panel). A control experiment showed that the same antibody also precipitated HA-dLsm11 (Fig. 3A, upper panel). Moreover, both proteins could be coprecipitated by the monoclonal anti-Sm antibody Y12 (Fig. 3B). These results indicated that dLsm10 and dLsm11 interact with each other as well as with some of the standard Sm proteins in Drosophila S2 cells.
The reactivity of these antibodies was tested by Western blots. By using anti-dLsm11 antibody, we could not detect a signal for the endogenous protein in S2 cells, presumably because of low expression of dLsm11 or insufficient antibody activity (Fig. 3C, left panel). Similarly, endogenous Lsm11 from human tissue culture cells could previously not be detected by immunoblotting with an antibody raised against human Lsm11, unless U7 snRNPs had been enriched by biochemical fractionation or by immunoprecipitation (Pillai et al. 2003). However, a signal was detectable with the anti-dLsm11 antibody in Drosophila S2 cells stably expressing HA-dLsm11 from the metallothionein promoter, and this signal comigrated on gels with the band detected by anti-HA antibody (Fig. 3C). Taken together, these results proved that the anti-dLsm11 antibody reacts specifically with dLsm11.
Anti-Sm antibodies, in vertebrates, are known to react with the Sm proteins B/B′, D1, and D3 (Lehmeier et al. 1990), which contain symmetrical dimethyl arginine modifications on their RG-rich C-terminal tails (Brahms et al. 2000, 2001). However, by using the Y12 antibody, we could only detect the B/B′ and D1 proteins, not D3, on Western blots of nuclear extracts from human 293-T cells (Fig. 3D) or other mammalian cell lines (data not shown). Probing a Drosophila S2 cell nuclear extract on the same blot revealed two similar bands, one migrating slightly faster than human B/B′ and the other somewhat slower than human D1. This is in good agreement with the known size differences of the corresponding proteins: Human Sm B/B′ has a molecular mass of ~24 kD (but actually migrates as a ~29-kD protein), whereas the corresponding mass for Drosophila Sm B is ~22 kD; the human and Drosophila D1 proteins are ~13 and ~14 kD in mass, respectively. Thus, the Y12 antibody, in S2 cells, appears to react primarily with the Drosophila Sm B and D1 proteins.
Drosophila Lsm10 and Lsm11 assemble into complexes that contain Sm B but are lacking Sm D1 and D2
Although the above experiments strongly indicated that dLsm10 and dLsm11 are part of a snRNP complex containing at least some of the common Sm proteins, we wanted to obtain more specific information regarding the composition of this complex. In particular, we have shown that mammalian Lsm10 and Lsm11 replace Sm D1 and D2 in the unique U7 Sm core structure (Pillai et al. 2001, 2003). We postulated that, if dLsm10 and dLsm11 are true orthologs of the corresponding mammalian proteins and if the U7 Sm core structure has been conserved between vertebrates and arthropods, the two proteins should also be in a complex containing Sm B but lacking D1 and D2 in Drosophila S2 cells.
This hypothesis was tested by using additional expression constructs encoding Flag-tagged versions of the Drosophila Sm B and D2 ORFs. Flag-dB was well expressed after transfection of S2 cells (Fig. 4A,C,E, input lanes), whereas the expression of Flag-dD2 was low and sometimes barely detectable in whole-cell extract (Fig. 4B,D,F, input lanes). More importantly, both Flag-dB and Flag-dD2 were efficiently precipitated by Y12 antibody, indicating that they were incorporated into snRNPs. In contrast, the anti-dLsm11 antibody precipitated only Flag-dB, not Flag-dD2 (Fig. 4A,B, upper panels). Reprobing of the blots with Y12 antibody revealed both Sm B and D1 in the Y12 precipitates, but only Sm B, not D1, in the anti-dLsm11 precipitates (Fig. 4A,B, lower panels).
Even though these results were in agreement with the above hypothesis, we tried to corroborate them by additional experiments, especially because the lack of an Sm D1 signal in the Y12 immunoblots could also have been due to the generally low recovery of proteins in the anti-dLsm11 immunoprecipitations. Thus, combinations of HA-tagged dLsm10 or dLsm11 with Flag-tagged Sm dB or dD2 were cotransfected into S2 cells; whole-cell extracts were subjected to immunoprecipitation with anti-HA antibody and probed with either anti-Flag, anti-HA or Y12 anti-Sm antibody. In cells cotransfected with Flag-dB in combination with either HA-dLsm11 (Fig. 4C) or HA-dLsm10 (Fig. 4E), both Flag-dB and the HA-tagged Lsm protein could be revealed in the anti-HA immunoprecipitates. In contrast Flag-dD2 could not be coprecipitated with either HA-dLsm11 (Fig. 4D) or HA-dLsm10 (Fig. 4F), although the corresponding HA-tagged Lsm proteins were efficiently precipitated by the anti-HA antibody. This indicated that the complex containing Drosophila Lsm10 and Lsm11 proteins contains Sm B, but not D2, as would be predicted, in analogy with the composition of mammalian U7 snRNPs. Furthermore, reprobing the same blots or blots prepared from identical cotransfection/immunoprecipitation experiments with Y12 antibody revealed the coprecipitation of endogenous Drosophila Sm B, but not D1, by the anti-HA antibody (Fig. 4C–F, bottom panels), indicating that the complex containing Drosophila Lsm10 and Lsm11 proteins does not contain Sm D1. Note, that more Sm dB was precipitated by anti-HA than by anti-dLsm11 antibody (Fig. 4A,B), presumably because of the higher expression of the HA-tagged dLsm10 and dLsm11 proteins and/or because of more efficient immunoprecipitation by the anti-HA antibody.
Drosophila Lsm10 and Lsm11 associate with U7 snRNA and histone pre-mRNA in S2 cells
Having established that dLsm10 and dLsm11 show the same pattern of associations with Sm proteins as do their counterparts in the mammalian U7 snRNP, we wanted to analyze if they also associate with Drosophila U7 snRNA in S2 cells. This was facilitated by the recent determination of the Drosophila U7 snRNA sequence (Dominski et al. 2003). We designed primers allowing the detection of Drosophila U7 snRNA by reverse transcription-coupled PCR (RT-PCR). The expected RT-PCR signal of 88 nucleotides was produced by using RNA isolated from Drosophila S2 cell nuclear extract as template (Fig. 5A, lane 1). The RNA responsible for this band was immunoprecipitated with anti-dLsm11 (lane 3) and Y12 antibodies (lane 4), but it could not be precipitated by protein G–Sepharose beads without antibodies (lane 2). Moreover, in S2 cells stably expressing HA-dLsm11, U7 snRNA could be immunoprecipitated with anti-HA antibody (lane 6), whereas anti-HA precipitation of extract from cells not expressing HA-dLsm11 did not coprecipitate U7 snRNA (lane 5). Thus, Drosophila U7 snRNA is associated with Sm proteins and, more specifically, with dLsm11 in vivo.
Next, we analyzed whether precipitating the U7 snRNP with anti-dLsm11 antibody also coprecipitated histone pre-mRNA, as would be expected for a particle involved in histone RNA 3′ end processing. Thus, samples of the RNAs already used in Figure 5A were analyzed by RT-PCR assays for histone H3 pre-mRNA and, as a negative control, for β-actin pre-mRNA. The results showed that histone H3 pre-mRNA could be precipitated by both anti-dLsm11 (Fig. 5B, lane 3) and Y12 antibodies (lane 4), but not by protein G–Sepharose beads alone (lane 2). Moreover, similar to U7 snRNA shown above, H3 pre-mRNA was also enriched by immunoprecipitation with anti-HA antibody from nuclear extracts of S2 cells stably expressing HA-dLsm11 (lane 6). This was due to a specific association of H3 pre-mRNA with a complex containing HA-dLsm11, because the anti-HA antibody did not precipitate H3 pre-mRNA from nuclear extract of S2 cells not expressing the HA-tagged protein (lane 5). As expected and in contrast to H3 pre-mRNA, β-actin pre-mRNA could only be detected in the input nuclear extract and in the Y12 immunoprecipitation (presumably due to its association with spliceosomal snRNPs), but not in any of the other immunoprecipitates (Fig. 5C). These experiments indicated that the complex containing Drosophila Lsm11 protein is also, albeit temporarily, associated with histone H3 pre-mRNA.
Assembly of Drosophila Lsm10 and Lsm11 into mammalian U7 snRNPs
As the above experiments indicated that the composition of the U7 Sm core is conserved between vertebrates and arthropods, we tested whether dLsm10 and dLsm11 could be assembled into U7 snRNPs in mammalian cells and whether, by doing so, they could discriminate between U7 and U1 snRNA. For this purpose, we transiently co-expressed HA-dLsm10 and HA-dLsm11 under the control of the human cytomegalovirus promoter in human 293-T cells. U7 or U1 snRNPs were precipitated from nuclear extracts by using biotinylated oligonucleotides complementary to the 5′ ends of the corresponding snRNAs. The HA-tagged proteins were then detected in the precipitates by Western blotting with anti-HA antibody. Indeed, both HA-dLsm10 and HA-dLsm11 associated with U7 snRNA, but similar amounts of the two proteins were also precipitated by the anti-U1 oligonucleotide (Fig. 6, upper panel). However, when comparing these band intensities, one must consider that the cells contain ~100 times more U1 than U7 snRNPs (Grimm et al. 1993) and that different amounts of snRNPs will therefore be precipitated by the two antisense oligonucleotides. This was confirmed by reprobing the Western blot with Y12 anti-Sm antibody, which detects the Sm B/B′ protein present in both the U7 and U1 snRNPs (Fig. 6, lower panel). Indeed, the Sm B/B′ signal was much stronger in the precipitate obtained with U1 oligonucleotide. This implied that dLsm10 and dLsm11 preferentially bound to U7 snRNA. Taken together, these results indicated that the protein–protein and protein–RNA interactions involved in the assembly of the U7 Sm core must be evolutionarily conserved between arthropods and vertebrates.
DISCUSSION
The central role of the U7 snRNP for histone RNA 3′ end processing and its importance for studying Sm core assembly, nuclear import of snRNPs, as well as Cajal body formation and function (see Introduction) motivated us to characterize this particle in Drosophila, an organism that is amenable to various types of investigation (classical and reverse genetics, RNAi, biochemistry, developmental studies). Moreover, through the study of evolutionarily distant orthologs, we hoped to identify conserved amino acid residues that may provide important cues for more detailed functional studies. Finally, as mentioned in the Introduction, evolutionary aspects of histone RNA processing were also of considerable interest in their own right. The fact that orthologs of Lsm10 and Lsm11, ZFP100 and, until recently, U7 snRNA had not been identified in lower invertebrates, allowed the alternative hypothesis that histone RNA processing, despite the presence of the conserved hairpin and HBP/SLBP, might operate by a mechanism independent of a U7 snRNP in these organisms. Initial difficulties to identify Lsm10 orthologs in invertebrates (Pillai et al. 2001) also raised the possibility that only Lsm11 but not Lsm10 might be present.
Our identification of putative Lsm10 and Lsm11 sequences in various invertebrates (Figs. 1, 2) and the characterization of the corresponding Drosophila proteins as true orthologs of their vertebrate counterparts contradict both of these alternative views. In Drosophila S2 cells, dLsm10 and dLsm11 associate with each other and with Sm B, as well as with Drosophila U7 snRNA (Figs. 3–5). Conversely, Sm dD1 and dD2 are not found associated with either dLsm10 or dLsm11. This pattern of associations is identical to the one found in vertebrates. Thus, although we have not analyzed the other Sm proteins, we postulate that the composition of the U7 Sm core structure is identical in arthropods and in vertebrates. Moreover, despite the fact that a Drosophila ZFP100 ortholog has not yet been identified, the basic constituents and hence also the mechanism of histone RNA processing appear to be the same in both types of organisms. Extrapolating from these findings, it seems likely that the key elements of the histone RNA 3′ processing machinery will also be conserved in the other metazoans and in the additional organisms that have nonpolydenylated histone mRNAs ending in the conserved hairpin (Wittop Koning and Schümperli 1994) or for which a putative HBP/SLBP ortholog can be found in sequence databases.
This similarity in the composition of the U7 snRNP and the histone RNA processing machinery contrasts with a high degree of primary sequence divergence of individual components. The recently characterized Drosophila U7 snRNA (Dominski et al. 2003) is so different from its known vertebrate and sea urchin counterparts that it could not be discovered by a bioinformatic approach. The same is true for the Lsm10 protein, for which we initially could not find any invertebrate orthologs (Pillai et al. 2001). The sequences of the potential tunicate and arthropod orthologs are so different that only the biochemical experiments presented in Figures 3–5 could confirm the identification of the Drosophila protein as Lsm10.
The situation is somewhat different for the large Lsm11 protein (Fig. 2). The Drosophila and Ciona orthologs were quite readily discovered by bioinformatic searching. Apart from the characteristic Sm motifs, two prominent features of Lsm11 are conserved in all potential orthologs: The first Sm motif is preceded by a large N-terminal extension, and the two Sm motifs are separated by a long spacer. This spacer is not conserved in primary sequence, but the fact that it is present in all putative orthologs suggests that it may perhaps play a role that is not sequence specific. Consistent with the possibilities both of a lack of function or of a sequence-unspecific function, we have previously shown that more than half of this spacer can be deleted from murine Lsm11 without affecting either U7 snRNP assembly or histone RNA processing (Pillai et al. 2003). In contrast, the phylogenetic comparison with invertebrate sequences reveals that the N terminus of Lsm11 contains four conserved sequence patches. Secondary structure modeling by the PHD program suite (Rost and Sander 1994,Rost and Sander 2000) indicates a high propensity of these sequences for α-helix formation, whereas the remainder of the N terminus does not contain any predicted secondary structure elements. It goes without saying that these conserved sequence patches are primary candidates for being functionally involved in histone RNA processing and in the binding of ZFP100 or of additional, yet unknown processing factors.
The coprecipitation of Drosophila U7 snRNA and histone H3 pre-mRNA with anti-dLsm11 antibody (Fig. 5) confirms that the characterized particle indeed is the Drosophila U7 snRNP. Moreover, Drosophila U7 snRNA is also coprecipitated by anti-HA antibody in S2 cells stably expressing HA-dLsm11, but not in S2 cells that do not express any HA-tagged protein. This proves that HA-dLsm11 indeed gets incorporated into U7 snRNPs and validates the results from the association studies carried out with tagged proteins. Although we do not yet know if the U7 snRNPs containing HA-dLsm11 are functional in histone RNA processing, they are able to interact with histone H3 pre-mRNA, just like U7 snRNPs containing the endogenous, untagged dLsm11, as judged by the coprecipitation of H3 pre-mRNA (Fig. 5). Thus the tagged protein may in the future be useful for functional complementation studies, either in flies carrying a mutation in the dLsm11 gene or in S2 cells in which the dLsm11 gene has been silenced by RNA interference.
The ability of HA-dLsm10 and HA-dLsm11 to get incorporated into U7 snRNPs in mammalian cells (Fig. 6) indicates that despite considerable differences at the primary sequence level, the protein–protein and protein–RNA interactions involved in U7 snRNP assembly must be evolutionarily conserved. However, initial experiments in which the two Drosophila Lsm proteins were transfected individually resulted in a less efficient incorporation into U7 snRNPs and, for HA-dLsm10, also in a poorer expression (data not shown). As Sm core assembly proceeds through specific Sm protein heterodimers and heterotrimers (Raker et al. 1996, 1999) and as Lsm10 and Lsm11 are presumably part of the same heterodimer (Pillai et al. 2003), the poorer performance of individual Drosophila Lsm proteins is most likely due to the fact that they must form an Lsm10/11 heterodimer with a partner from a different species.
Although the cotransfected HA-dLsm10 and HA-dLsm11 showed a preference for U7 over U1 snRNA in mammalian cells (Fig. 6), it must be noted that this discrimination was less pronounced than when mammalian Lsm10 or Lsm11 proteins were previously tested in the same kind of assay (Pillai et al. 2001, 2003). The reason for this reduced selectivity is unknown, but this phenomenon deserves further investigation, as such studies may contribute to understanding how the U7-specific and spliceosomal Sm core structures are specified. Moreover, it will be interesting to see if U7 snRNPs formed with Drosophila Lsm10/11 in a vertebrate context are functional in histone RNA 3′ processing.
In conclusion, these studies have revealed that the special architecture of the U7 snRNP is evolutionarily conserved, and have opened the possibility to study the assembly and function of this interesting particle in the Drosophila system that is equally amenable to classical and reverse genetics as well as to biochemical investigation.
MATERIALS AND METHODS
Plasmids
Original clones were obtained through the Berkeley Drosophila Genome Project Database (BDGP). The dLsm11 (EST BG640322), dLsm10 (genomic bacmid BAC R13J10), dSmB (EST LD14049), and dSmD2 (EST LD03002) ORFs were amplified by PCR and subcloned for expression in Drosophila cells into modified pMTv/A expression vectors (Invitrogen). These pMTv-HA and pMTv-Flag vectors contained N-terminal HA (MAYPYDVPDY ASLE) and Flag (DYKDDDDK) epitope tag-encoding sequences, respectively, introduced as annealed DNA oligonucleotides.
For expression in mammalian cells as N-terminal HA tag fusions, the dLsm10 and dLsm11 ORFs were introduced into the pcDNA3-HA vector (Pillai et al. 2001). For expression in Escherichia coli as N-terminal GST fusion, the dLsm11 ORF was subcloned into pGex4T-3 (Pharmacia). All clones were verified by DNA sequencing. Details of the constructs are available on request.
Antibodies
To produce the antibody referred to as anti-dLsm11, insoluble GST-dLsm11 protein was recovered from the pellet fraction of the producing Escherichia coli strain BL21 (Pharmacia) and resolved by SDS-PAGE, and a gel fragment mixed with adjuvants was used for immunization of rabbits. Antibodies were affinity-purified by overnight incubation with recombinant proteins blotted on a nitrocellulose membrane, essentially as described (Pillai et al. 2001).
Additionally, monoclonal antibodies against Sm proteins B/B′, D1, and D3 (Y12; Lerner et al. 1981) or against the HA tag (BABCO) were used. The HA and Flag epitope tags were detected in Western blots using anti-HA (Roche) and anti-Flag (Sigma) antibodies directly coupled to horseradish peroxidase.
Cell culture and transfections
Human 293-T cells were cultured as described (Pillai et al. 2001). For transfection, they were grown in 10-cm dishes to 50%–60% confluency and transfected with a mixture of 5 μg each of pcDNA3-HA-dLsm10 and pcDNA3-HA-dLsm11 using Lipofectamine (Invitrogen). Cells were harvested 48 h after transfection.
Drosophila Schneider S2 cells were cultured in Schneider’s Drosophila medium with L-glutamine (AMIMED), supplemented with 10% heat-inactivated fetal calf serum, 100 U/mL penicillin G, and 100 μg/mL streptomycin (AMIMED) at 26°C (without CO2). For transient transfection, 106 cells were seeded in 10-cm dishes, grown to ~70% confluency, and transfected with 10 μg (or mixtures of 2 × 5 μg) of pMTv-derived plasmids by using CellFectin (Invitrogen). Expression of proteins from the metallothionein promoter was induced at day 2 by the addition of 500 μM copper sulfate to the medium, and cultures were continued for another 4 d.
For stable transfection, S2 cells were cotransfected with pMTv-HA-dLsm11 and the pCoHygro selection vector (Invitrogen) in a 19:1 ratio (4 μg/well of a six-well plate), according to the manufacturer’s instructions. Five days later, the cells were harvested and resuspended in Schneider’s medium, supplemented as above but additionally containing 400μg/mL hygromycin B, and grown for 3 to 4 weeks. Stably transfected S2 cells were split when the cell density reached between 2 and 6 × 106 cells/mL (Hill et al. 2001).
Extract preparation and protein interaction studies
The preparation of small-scale nuclear or whole-cell extracts, immunoprecipitations, and precipitations with biotinylated oligonucleotides complementary to the 5′ ends of U7 or U1 snRNA were performed as described (Pillai et al. 2001).
Proteins were resolved on 12% high-TEMED SDS–polyacrylamide gels (Will et al. 1994). Western blots were probed with Y12 anti-Sm (Lerner et al. 1981), anti-HA-HRP, or anti-Flag-HRP antibodies and developed by the ECL method (Amersham).
Coprecipitation and detection of Drosophila U7 snRNA as well as histone H3 and β-actin pre-mRNAs
Large scale nuclear extracts from Drosophila S2 cells were prepared as described. Total RNAs were isolated from immunoprecipitates obtained with different antibodies (anti-dLsm11, Y12, or anti-HA) from 1 mL samples of S2 wild-type nuclear extract (large scale; Dignam et al. 1983; Pillai et al. 2001) or from 400 μL of nuclear extract from the S2 cell line stably expressing HA-dLsm11 (small scale; Pillai et al. 2001). The RNAs were extracted at 65°C with a buffer containing 10 mM Tris-HCl (pH 8), 5 mM EDTA, 0.5% SDS, 1 mg proteinase K, and 4 μg glycogen.
Drosophila U7 snRNA, histone H3 pre-mRNA, and β-actin pre-mRNA were amplified by RT-PCR using the rTth kit (Perkin Elmer) and α-32P-dCTP, essentially as described (Suter et al. 1999). The primers and conditions used are listed in Table 1. RT and PCR conditions were selected for each primer pair and are available on request.
Primers and amplicon lengths for RT-PCR
Sequence alignment of Lsm10 proteins. Additional sequences, accession numbers, and links are available at our online database (http://www.izb.unibe.ch/res/schuehome/schuemperli/Lsm10.html). The sequences are from the following species: Hs, Homo sapiens, human (Unigene Hs.3496); Mm, Mus musculus, house mouse (Unigene Mm.41865); Xl, Xenopus laevis (Unigene Xl.6554); Gg, Gallus gallus, chicken (ESTs BU211735, BU445159, BU461367); Om, Oncorhynchus mykiss, rainbow trout (EST CA368590); Ci, Ciona intestinalis, transparent sea squirt (ascidian tunicate; Unigene Cin.8743); Bv, Boltenia villosa (ascidian tunicate; protein AAM76159); Dm, Drosophila melanogaster (FlyBase CG12938); and Ag, Anopheles gambiae, African malaria mosquito (predicted partial ORF EAA01310). Residues identical to the human protein are shown in inverse print. The conserved Sm motifs 1 and 2 are indicated by shaded boxes. The consensus sequences were adapted from previous sources (Hermann et al. 1995; Achsel et al. 2001). h indicates hydrophobic amino acids.
Sequence alignment of Lsm11 proteins. Additional sequences, accession numbers, and links are available at our online database (http://www.izb.unibe.ch/res/schuehome/schuemperli/Lsm11.html). The sequences are from the following species: Hs, Homo sapiens, human (UniGene Hs.187117); Mm, Mus musculus, house mouse (UniGene Mm.45659); Xl, Xenopus laevis (GenBank AF514310, UniGene Xl.13277); Tr, Takefugu rubripes, pufferfish (scaffold 544, positions 60′000–63′000); Ci, Ciona intestinalis, transparent sea squirt (ascidian tunicate; Unigene Cin.4879); and Dm, Drosophila melanogaster (Flybase CG12924). Residues identical to the human protein are shown in inverse print. Dark grey bars indicate conserved sequence patches in the N-terminal region. The conserved Sm motifs 1 and 2 are indicated by shaded boxes. The consensus sequences were adapted from previous sources (Hermann et al. 1995; Achsel et al. 2001). h indicates hydrophobic amino acids.
In vivo association of Drosophila Lsm10 and Lsm11 with each other and with Sm proteins. (A) Immunoprecipitation by an affinity-purified antibody raised against recombinant GST-tagged dLsm11 (α-dLsm11). Drosophila S2 cells were transiently transfected with expression plasmids encoding HA-tagged dLsm11 (top) or dLsm10 (bottom), and whole-cell extracts were subjected to immunoprecipitation, SDS-PAGE, and Western blotting with anti-HA antibody. Input indicates sample of nuclear extract prior to immunoprecipitation; beads, control precipitation by protein G–Sepharose beads without antibody. Asterisks (*) indicate bands of slower electrophoretic mobility that react with the anti-HA antibody subsequent to immunoprecipitation; it is not known whether these represent potentially modified HA-dLsm11 protein (the expected reacting band is indicated by an arrow). (B) Immunoprecipitation of nuclear extracts from similar transfections by monoclonal anti-Sm antibody Y12. Lane labels and symbols are used as in A. (C) Reactivity of the anti-dLsm11 antibody used in A. Duplicate samples of whole-cell extracts from untransfected Drosophila S2 cells or cells stably expressing HA-dLsm11 were subjected to SDS-PAGE and blotted onto the same filter. The two filter halves were probed with either anti-dLsm11 or anti-HA antibodies as indicated below the panels. Both antibodies react with a protein of identical electrophoretic mobility representing HA-dLsm11 (arrow). (D) Reactivity of the anti-Sm antibody Y12 used in B Nuclear extracts from human 293-T cells and from Drosophila S2 cells were subjected to SDS-PAGE and Western blotting with Y12 antibody. The antibody reacts primarily with human Sm B/B′ and D1 and with their corresponding Drosophila orthologs.
In vivo association of Drosophila Lsm10 and Lsm11 with Sm B, but not with Sm D1 and D2. Drosophila S2 cells were transiently transfected with expression plasmids encoding Flag-tagged Sm proteins dB (A, C, E) or dD2 (B, D, F) either alone (A, B) or in combination with HA-dLsm11 (C, D) or HA-dLsm10 (E, F). Whole-cell extracts were subjected to immunoprecipitation with the antibodies indicated above the lanes. The samples were subjected to SDS-PAGE, and Western blots were probed by using the antibodies indicated on the left of the figure. Input indicates sample of nuclear extract prior to immunoprecipitation; beads, control precipitation by protein G–Sepharose beads without antibody. A band corresponding to protein G released from the beads is visible in all the Y12 blots, just above the band specific for Sm dB. The black separation line in the bottom panel of F indicates that the input sample was run in a different position on this gel and had to be remounted.
Association of Drosophila Lsm11 with Drosophila U7 snRNA and histone H3 pre-mRNA. Nuclear extracts of either untransfected Drosophila S2 cells (lanes 2–5) or S2 cells stably expressing HA-dLsm11 (lane 6) were subjected to precipitation with protein G–Sepharose beads either lacking antibody (lane 2) or coated with the antibodies indicated above each lane. Lane 1, sample of input nuclear extract from untransfected S2 cells. RNA from each sample was extracted and subjected to RT-PCR for Drosophila U7 snRNA (A), histone H3 pre-mRNA (B), or β-actin pre-mRNA (C). M indicates end-labeled HpaII digest of pBR322 used as size marker.
Association of Drosophila Lsm10 and Lsm11 with U7 snRNA in mammalian cells. Human 293-T cells were cotransfected with plasmids encoding HA-dLsm10 and HA-dLsm11. Nuclear extracts were subjected to precipitation with magnetic streptavidin beads coated with biotinylated oligonucleotides complementary to human U7 or U1 snRNAs as indicated. The pellets were subjected to SDS-PAGE, and a Western blot was probed with anti-HA or Y12 antibody (revealing Sm B/B′), as indicated on the left of the figure. Input indicates sample of original nuclear extract; beads, precipitation with beads not coated with oligonucleotide.
Acknowledgments
We thank O. Mühlemann and B. Suter for critical comments, as well as K. Schranz and A. Gruber for technical help. This work was supported by the State of Bern and by Swiss National Science Foundation grant 31-65225.01 to D.S.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 USC section 1734 solely to indicate this fact.
Footnotes
-
Article and publication are at http://www.rnajournal.org/cgi/doi/10.1261/rna.5143303.
-
- Accepted September 4, 2003.
- Received July 28, 2003.
- Copyright 2003 by RNA Society

















