|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
BIOINFORMATICS |
1 Biomedical Sciences Graduate Program, 2 San Diego Supercomputer Center, and 3 Department of Cellular and Molecular Medicine, University of CaliforniaSan Diego, La Jolla, California, 92093, USA
4 Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, USA
Reprint requests to: Xiang-Dong Fu, Biomedical Sciences Graduate Program and Department of Cellular and Molecular Medicine, University of CaliforniaSan Diego, La Jolla, CA 92093, USA; e-mail: xdfu{at}ucsd.edu; fax: (858) 534-8549; or Michael Gribskov, Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA; e-mail: gribskov{at}purdue.edu; fax: (765) 496-1189.
| ABSTRACT |
|---|
|
|
|---|
Keywords: MAASE database; constitutive splicing; alternative splicing; regulatory motifs; transposable elements; evolution of alternative splicing
| INTRODUCTION |
|---|
|
|
|---|
Previous studies have detected a number of differences between alternatively and constitutively spliced exons. For example, while the sequences of constitutive splice sites more closely resemble that of the consensus splice site sequence than do those of alternative splice sites, alternatively spliced exons are shorter in length, have a higher degree of sequence conservation between orthologous exons, and maintain transcript reading frame at a higher frequency than do constitutively spliced exons (Senapathy et al. 1990
; Stamm et al. 2000
; Clark and Thanaraj 2002
; Sorek and Ast 2003
; Thanaraj and Stamm 2003
; Itoh et al. 2004
; Sorek et al. 2004b
; Sugnet et al. 2004
). However, the focus in many of these studies has been on skipped exons and/or one species at a time.
The emphasis on skipped exons is possibly because of the ease in the identification of skipped exons by aligning mRNA/ EST sequences with genomic sequence. Although several alternative splicing database efforts considered different modes of splicing, those analyses suffered from either limited database content or insufficient quality control on the computationally derived information. We have constructed the MAASE database system to support our splicing microarray efforts (Zheng et al. 2004
). The MAASE database system consists of a manual/ computational tool for annotating alternative splicing events and a database of highly curated information annotated. In constructing MAASE, we compiled detailed information (i.e., mode of splicing) to assist with microarray oligonucleotide design, as well as to better relate experimental results to specific splicing pathways. In addition, the MAASE database system includes curated information for both human and mouse, thus providing a unique opportunity to systematically characterize different modes of alternative splicing.
In this study, we report features that distinguish alternative and constitutive splicing, and more importantly, features that distinguish distinct modes of alternative splicing. Our results confirm many previously reported differences between alternative and constitutive splicing. When these differences were further studied, we find many of these features also distinguish between different modes of splicing. Furthermore, we detect, for the first time, an association of multiple classes of transposable elements with alternatively spliced regions, suggesting a contribution of these repetitive elements to the origin of alternative splicing. Together, these results will facilitate the development of predictive tools for alternative splicing and uncover potential regulatory mechanisms for different modes of alternative splicing.
Data set
Data sets were retrieved from the Manually Annotated Alternatively Spliced Events (MAASE) Database (Zheng et al. 2004
; Zheng et al. 2005
). We use the following nomenclature and abbreviations for different splicing modes and their associated intron sequences. Constitutively Spliced exons (CS) are internal exons, which are supported by four or more annotated transcripts in MAASE, and which show no evidence of alternative splicing in MAASE. It is possible that this class may mistakenly include some alternative exons found in mRNAs or ESTs that are found in GenBank or other general databases, but were not selected for annotation in MAASE. Skipped Exons (ES) are those included in one transcript but skipped in another. Retained Introns (IN) are sequences included as a part of an exon in one transcript and excluded as an entire intron in another. Alternative Acceptor exons (AA) have competing 3' splice sites with the same downstream exon end. Alternative Donor exons (AD) have competing 5' splice sites with the same upstream exon start. It is important to note that the stringent definition of AA and AD exclude alternative 3' splice site choice paired with alternative polyadenylation and alternative 5' splice site choice linked with alternative promoters. This enabled us to focus our statistical analysis on individual splicing modes without confounding effects from other coupling events.
CS introns are sequences that lie between two constitutive exons. ES introns are sequences flanking skipped exons. AA introns are sequences upstream of the alternative 3' splice sites, whereas AD introns are those downstream of the alternative 5' splice sites. The combined data set retrieved from MAASE comprises 2641 and 1967 CS, 468 and 340 ES, 246 and 161 IN, 310 and 282 AA, and 176 and 164 AD from human and mouse, respectively. Alternatively Spliced exons (AS) refer to the combination of all modes of alternative splicing within our set. For simplicity, we did not include complex modes of alternative splicing (Zheng et al. 2005
) from the MAASE database.
| RESULTS |
|---|
|
|
|---|
Distinct sequence and structural features between CS and AS
Exon and intron length
It is well known that splice sites bracketing an exon are recognized by an "exon definition" mechanism in mammalian cells (Berget 1995
). This mechanism implies the potential effect of exon length on splice site recognition across the exon. If an exon is too short, steric hindrance may interfere with the interaction between the two ends. On the other hand, a long exon has a higher probability of harboring negative elements. Thus, splice site selection can be influenced by exon length.
The constraint on exon length is clearly evident by the similar lengths in CS between human and mouse, which have median lengths of 120 and 122 nt, respectively (Table 1
). In contrast, ES have median lengths of 101 and 88 nt, while IN have median lengths of 145 and 179 nt, in human and mouse, respectively. These results agree with previous reports that ES are significantly shorter than CS and that IN are significantly longer than CS (Stamm et al. 2000
; Thanaraj and Stamm 2003
; Galante et al. 2004
). The constitutive portion of AA and AD are similar in length to CS, whereas the alternative portion of both AA and AD are significantly shorter. Therefore, when the constitutive and alternative portions of AA and AD are combined, they are significantly longer than CS. It is interesting to note that all exons or exonic regions involved in alternative splicing show greater length variability than do CS (Table 1
). The high degree of variation may reflect a combination of multiple influences on splicing regulation.
|
|
|
|
|
It has been widely assumed that the stronger a splice site, the more often it is used. This observation forms a basis for computational methods to predict splicing regulatory elements (Fairbrother et al. 2002
). To test whether this is indeed the case, we determined the number of ESTs corresponding to the stronger (more conserved) or the weaker (less conserved) competing splice sites in AD and AA and find a positive correlation between the splice site strength and splice site selection in both human and mouse. In human, 63% of AA junctions and 58% of AD junctions show a positive correlation, with the remaining percentage of splicing events showing a negative correlation (perhaps due to other regulator mechanisms that are dominant over the splice site strength). In mouse, 61% for AA junctions and 55% for AD junctions shows the positive correlation. These findings demonstrate that stronger splice sites are preferred in >50% of the cases, thus supporting the contribution of splice site strength to splice site selection in gene expression.
For ES and IN, we could not use the same approach as above because the two alternative splice sites in these two modes are used as a unit. Therefore, we had to investigate the correlation between the combined strength of splice sites and EST frequency. We separated each splicing event into two classes as follows: one showing a higher EST frequency when the alternative splice sites are used rather than skipped (the "more retained" group), and the other with a higher EST frequency when the alternative splice sites are skipped rather than used (the "less retained" group). The splice site strengths within the two groups can then be compared to ask whether splice site strength is associated with splice site selection. Using this strategy, we found that, for ES, the "more retained" group has a median splice site strength significantly stronger than that of the "less retained" group in both human (P <8.8 x 106) and mouse (P < 6.6 x 104), thus suggesting that splice site strength and exon inclusion are positively correlated. For IN, the "less retained" has a median splice site strength significantly stronger than that of the "more retained" group in both human (P < 1.6 x 105) and mouse (P < 1.3 x 102), thus suggesting that splice site strength and intron removal are positively correlated.
Next, we investigated the relationship between exon length and splice site choice. For AD and AA, we counted the number of ESTs corresponding to junctions resulting in either the longer (using the proximal splice site) or shorter (using the distal splice site) spliced exon. Results indicate that the proximal sites (thus longer exons) are preferentially used in both AD and AA. In human, 66% of AA junctions and 70% of AD junctions correspond to the use of proximal sites. In mouse, the percentages are 67% for AA junctions and 65% for AD junctions. This bias likely reflects the so-called "proximal rule", which states that the proximal splice site will be preferentially selected when the two competing sites have equal splice site strengths and similar cis-regulatory elements in flanking exons (Reed and Maniatis 1986
; Ibrahim el et al. 2005
). Because naturally occurring competing alternative splice sites are not exactly equal, as experimentally arranged, it is likely that splice site selection is influenced by splice site strength and distinct regulatory mechanisms in addition to the positional effect.
For ES and IN, results show an interesting relationship between exon length and splice site usage. With ES, the median length of "more retained" is significantly longer than that of "less retained" in both human (P < 1.4 x 5) and mouse (P < 1.1 x 2), meaning that the longer the exon, the more likely it is to be included. With IN, on the other hand, the median length of "more retained" is significantly shorter than those in "less retained" in both human (P < 2.6 x 5) and mouse (P < 3.7 x 3). This suggests that shorter introns may be more likely to be included than longer introns. However, these results may also be a reflection of the detection bias for shorter introns in sequencing EST clones, because shorter retained introns are more easily detected than longer retained introns.
Together, these results suggest a strong link between the strength of splice sites, the position of splice sites, and the length of alternative exons on the expression of an alternative splicing event. Similar trends in human and mouse strongly argue for the biological importance of these characteristics.
Sequence motifs implicated in splicing regulation
Because alternative splice sites are weaker than constitutive splice sites, it has been speculated that AS may have different frequencies and/or ratios of exonic splicing enhancers (ESEs) and silencers (ESSs) elements compared with CS. However, most of the studies conducted thus far focus on ES (Wang et al. 2004
; Zhang and Chasin 2004
). To test the generality of this observation in other modes of alternative splicing, we determined the distributions of computationally predicted ESEs and ESSs (Fairbrother et al. 2002
; Wang et al. 2004
; Zhang and Chasin 2004
) onto each exon class. The two sets of ESEs and ESSs from the Burge (RESCUE-ESE and FAS-ESS) and Chasin (PESE and PESS) groups were separately mapped. RESCUE-ESEs are hexamers found to be enriched in exon versus intron sequences and in exon sequences with strong splice sites versus weak splice sites (Fairbrother et al. 2002
). FAS-ESSs are hexamers identified as splicing silencers in a functional assay (Wang et al. 2004
). PESEs and PESSs are octamers found to be enriched and depleted, respectively, in internal noncoding exons versus unspliced pseudo exons and in internal noncoding exons versus 5' untranslated regions of intronless genes (Zhang and Chasin 2004
). Each element was mapped within each exon sequence in each exon class (Fig. 1
). The Wilcoxon rank sum test was then used to compare the distributions of a specific element within CS and an AS class (see Materials and Methods for further details). Comparisons resulting in a P-value
0.05 [and thus (1-p)
0.95] are highlighted in red. Individual hexamers are arranged according to their differential distributions in comparison between CS and ES. Clearly, a set of hexamers is over-represented in ES, while a different set is under-represented in ES (thus, over-represented in CS). Interestingly, when different modes of alternative splicing were compared with CS, different sets of over- and under-represented hexamers were identified, indicating that different modes of alternative splicing may have distinct preferences in selecting positive (ESEs) and negative (ESSs) regulatory elements. The most dramatic case can be seen in the comparison between CS and IN. IN have a dramatically higher frequency of ESSs. IN are, in this sense, more similar to introns. Trends are consistent for both human and mouse and for the two sets of previously predicted ESEs and ESSs.
|
Figure 2
shows the intergenomic comparison of each mode of AS with CS. Using the RSA tools (van Helden et al. 2000
), frequencies of all possible 4096 hexamers were compared with a third-order Markov background model consisting of CS and AS to obtain a Z-score for each hexamer. Z-scores represent the standard deviation of each hexamer from the background model; however, they are not linearly comparable. Therefore, to directly compare the hexamers in different modes of AS to CS, Z-scores were converted to P-values (based on the standard normal curve). Over- and under-represented hexamers in each AS class were identified by comparing the P-values of hexamers from each AS class with the P-values of corresponding hexamers in the CS class (
p = log pCSlog pAS). A positive
p represents a hexamer that is under-represented in an AS class, while a negative
p represents a hexamer that is over-presented in an AS class. The plot shows
p for each AS exon class for human and mouse. In each plot, hexamers in quadrant HM are those under-represented in AS (thus over-represented in CS) in both human and mouse, while hexamers in quadrant H+M+ are those over-represented in a class of AS (thus under-represented in CS) in both human and mouse. The conservation between the two species suggests that the enriched hexamers have a higher potential to be biologically relevant. Hexamers in HM+ and H+M represent species-specific differences. Assuming that regulatory mechanisms are conserved between human and mouse (Yeo et al. 2004
), the number of species-specific hexamers is expected to be low. Quadrants HM+ and HM+ can therefore be used to choose a cut-off for compiling a list of hexamers in quadrants H+M+ and HM that are over-represented and under-represented in each class of AS.
|
Further analysis of these over- and under-represented AS hexamers reveals the presence of previously predicted ESEs and ESSs (Fairbrother et al. 2002
; Wang et al. 2004
; Zhang and Chasin 2004
). Our AS hexamers were directly compared with RESCUE-ESE and FAS-ESS hexamers. Over-represented hexamers, those which are present three or more times within the PESE and PESS octamers, were used to compare with our AS hexamers (Wang et al. 2004
). Results reveal strikingly consistent findings in the frequency and distribution of ESEs and ESSs within different modes (Fig. 2
). However, because there are a higher number of previously predicted ESEs than ESSs, interpretations concerning these ratios need to be carefully considered. Beside the trend in ratios, it is clear that one set of ESEs are selectively enriched in ES compared with CS (H+M+) and a different set of ESEs are selectively depleted in ES compared with CS (HM). In IN, on the other hand, the prominent features are the enrichment of ESSs and the depletion of ESEs. Thus, IN resembles regular introns in terms of the distribution of regulatory motifs. Interestingly, AA and AD seem to have significantly reduced ESEs, suggesting that the depletion of positive regulatory elements may be a key mechanism for these modes of alternative splicing. Together, the observations re-enforce the possibility that different mechanisms are involved in regulating different modes of alternative splicing.
Exonization of multiple types of transposable elements
Mammalian genomes have remarkably high percentages of transposable elements (Lander et al. 2001
; Waterston et al. 2002
). ALU elements within intronic regions have been shown to be capable of becoming exonized, contributing to ~5% of the skipped exons surveyed (Sorek et al. 2002
, 2004a
). There are, however, multiple classes of transposable elements, including short interspersed nuclear elements (SINEs), long interspersed nuclear elements (LINEs), long terminal repeats (LTRs), and DNA transposons (DNAs). The tendency for these elements to become exonized remains unknown. In addition, the contribution of transposable elements to the creation of other modes of alternative splicing has yet to be investigated.
Contribution of SINEs to alternative splicing in both human and mouse
SINEs are 100400 nt retrotransposons, which include ALU elements, and make up ~13% of the human genome and ~8% of the mouse genome (Lander et al. 2001
; Waterston et al. 2002
). Our analysis shows that SINE elements are abundantly present in introns (Fig. 3
). SINE elements are relatively low in abundance near the splice site and become more pronounced farther into the intron. This abundance is seen in both CS and AS junctions. Previous reports indicate that Alu elements in human are also present in exons, but they are only associated with AS (Sorek et al. 2002
; Lev-Maor et al. 2003
). We have extended the analysis to all classes of SINEs and found that they are associated with AS, not CS. Furthermore, when AS were separately analyzed according to splicing modes, we found that only ES are associated with SINE elements.
|
Role of other transposable elements in alternative splicing
To systematically survey the potential involvement of the remaining classes of transposable elements in alternative splicing, we performed a similar analysis on LINEs, LTRs, and DNAs. LINEs are ~6000-nt-long retrotransposons, which make up ~21% of the human genome and ~19% of the mouse genome (Lander et al. 2001
; Waterston et al. 2002
). As shown in Figure 3
, LINEs are preferentially associated with AS, with the L1 type being the major category and ES being the majority of AS. We note a lower occurrence and a lower AS association of LINEs in the mouse, but the explanation is unclear.
LTRs make up ~8.5% of the human genome and ~10% of the mouse genome, whereas DNAs are the least abundant element, making up ~3% in human and ~1% in mouse. Similar to SINEs and LINEs, LTRs and DNAs are both present in AS in human. The majority of the DNAs associated with AS are MERs (Lander et al. 2001
; Waterston et al. 2002
). We could not find any LTRs or DNAs in either CS or AS in mouse. It is unclear whether this represents a true difference between human and mouse, is because of a limited sample size, or is a reflection of the low abundance of DNAs in mouse.
Lack of association between other genomic repeat classes and alternative splicing
Results described thus far support a specific association of transposons with AS, with ES being the predominant exon class. To determine whether this association is a general theme across all classes of repeated elements or is only restricted to transposable elements, we mapped two additional repeat classes, simple repeats and low complexity repeats. These two repeat elements are observed in both CS and AS (Fig. 3
). There appears to be no positional preference of either of these two repeat classes for either AS or CS in either human or mouse. Thus, our findings suggest that transposable elements, but not other types of repeat elements, may play a specific role in the evolution of AS (Kazazian 2004
; Miller and Capy 2004
).
| DISCUSSION |
|---|
|
|
|---|
We have now significantly extended previous findings by showing that many of these features also differ between different modes of AS. For example, different modes of splicing have distinct splice site strengths, distinct alternatively spliced exon length distributions, and distinct frequencies of reading frame maintenance. Furthermore, different modes of splicing are shown to be associated with different distributions and frequencies of ESEs and ESSs, suggesting that different modes of splicing may be differentially regulated. To investigate this critical issue, we identified over- and under-represented hexamers in each AS class compared with CS. Our analysis revealed distinct hexamers associated with distinct modes of splicing. In addition, we find that distinct modes are also associated with different elements of previously predicted ESEs and ESSs, suggesting the potential for mode-specific regulation. It will be interesting to further study these mode-specific hexamers, ESEs and ESSs.
Finally, analysis of the different modes of splicing also illustrates that different modes may have emerged from distinct evolutionary paths. Distinct evolutionary pathways have been previously postulated for ES and mutually exclusive exons (Kondrashov and Koonin 2001
); here we report observations suggesting the same may be true for other modes as well. Throughout our analysis, IN is clearly distinct from other modes of AS; however, differences also exist between ES, AA, and AD. ES are shorter in length and have a higher frequency of reading frame preservation than AA and AD. In addition, only ES show a specific association with transposable elements. This suggests that exonization of transposable sequences is not a general phenomenon of alternative splicing, rather, it is a specific trait associated with the creation of ES. The differences between ES, AA, and AD, in fact, highlight fundamental similarities between CS, AA, and AD. For example, CS and the constitutive portions of AA and AD share similar degrees of reading frame preservation and similar exon-length distributions. These similarities suggest a potential evolutionary mechanism in which the constitutive portions of AA and AD originate as CS, with portions of neighboring introns being later converted into the alternative portions of AA and AD (Kondrashov and Koonin 2003
).
In conclusion, annotation of alternative splicing events in both human and mouse has allowed us to make general distinctions between constitutive and alternative splicing events and to identify specific characteristics associated with different modes of alternative splicing. Future enlargement of the MAASE database will facilitate further refinement of the features and motifs associated with AS. Our findings may serve as a foundation for the development of alternative splicing prediction tools for mammalian genomes. Finally, unique sequence features associated with AS may allow for the elucidation of cellular regulatory programs for alternative splicing when combined with information on the expression of trans-acting factors.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Splice site strength calculation
A log-odds matrix was used to calculate the score of each splice site. Separate log-odds matrices were built for human and mouse using their respective CS splice sites. The nucleotide positions scored for acceptor splice sites spanned from the last 13 nt of the intron to the first nucleotide of the exon. The nucleotide positions scored for donor splice sites spanned from the last 3 nt of the exon to the first 7 nt of the intron. The scoring function is defined as:
![]() |
where F(Xi) is the frequency of finding X at position i in sequence I (each donor or acceptor sequence) and Q(X) is the frequency of X in the corresponding CS splice site.
Orthologous exons
We identified orthologous humanmouse genes using Homolo-Gene (Wheeler et al. 2004
). Between orthologous genes, all human exons were compared with all mouse exons using BLASTZ (Schwartz et al. 2000
). The exon pairs scoring the highest sequence similarity were defined to be orthologous exon pairs. Default parameters were used.
The mapping of previously predicted ESEs and ESSs
Previously predicted ESEs and ESSs from Chris Burges group (RESCUE-ESEs and FAS-ESSs) and Larry Chasins group (PESEs and PESSs) were mapped onto each exon of each exon class. The distribution of elements in an AS class is compared with CS using the Wilcoxon rank sum test.
The identification of potential mode-specific regulatory motifs
Using the RSA tools (van Helden et al. 2000
), all 4096 possible hexamers were counted and compared with a background model consisting of a third-order Markov model computed from the joint set of CS and AS from the same species to obtain a Z-score for each hexamer. Z-scores measure the deviation from expectation in units of standard deviations. Each Z-score was converted to a P-value using the standard normal curve. This conversion allowed over- and under-represented hexamers for each AS class to be identified by comparing with hexamers from CS. For each hexamer, the P-value for each AS mode was subtracted from the P-value for CS (
p = log pCS log pAS). Hexamers with positive
p values are those that are under-presented in an AS class, while a negative
p score are those that are over-represented in an AS class. This
p value for each hexamer was similarly calculated for human and mouse.
Analysis of genomic repeat elements
Human and mouse repeat elements were retrieved from the annotated databases of the UCSC Genome Browser (http://genome.ucsc.edu/index.html). The genomic locations of each repeat class (SINEs, LINEs, LTRs, DNAs, simple repeats, and low complexity repeats) were surveyed for genomic positional overlap with specific CS and AS exon and intron regions surrounding the splice sites. The sequence regions bordering individual splice sites, from 150 to +150, were divided into 25 nucleotide bins. The frequency at which a bin overlapped with a specific repeat element is calculated.
| ACKNOWLEDGMENTS |
|---|
| Footnotes |
|---|
Received March 18, 2005; accepted August 29, 2005.
| REFERENCES |
|---|
|
|
|---|
Baelde, H.J., Eikmans, M., Doran, P.P., Lappin, D.W., de Heer, E., and Bruijn, J.A. 2004a. Gene expression profiling in glomeruli from human kidneys with diabetic nephropathy. Am. J. Kidney Dis. 43: 636650.[CrossRef][Medline]
Baelde, H.J., Eikmans, M., van Vliet, A.I., Bergijk, E.C., de Heer, E., and Bruijn, J.A. 2004b. Alternatively spliced isoforms of fibronectin in immune-mediated glomerulosclerosis: The role of TGFß and IL-4. J. Pathol. 204: 248257.[CrossRef][Medline]
Berget, S.M. 1995. Exon recognition in vertebrate splicing. J. Biol. Chem. 270: 24112414.
Brett, D., Pospisil, H., Valcarcel, J., Reich, J., and Bork, P. 2002. Alternative splicing and genome complexity. Nat. Genet. 30: 2930.[CrossRef][Medline]
Caceres, J.F. and Kornblihtt, A.R. 2002. Alternative splicing: Multiple control mechanisms and involvement in human disease. Trends Genet. 18: 186193.[CrossRef][Medline]
Clark, F. and Thanaraj, T.A. 2002. Categorization and characterization of transcript-confirmed constitutively and alternatively spliced introns and exons from human. Hum. Mol. Genet. 11: 451464.
Croft, L., Schandorff, S., Clark, F., Burrage, K., Arctander, P., and Mattick, J.S. 2000. ISIS, the intron information system, reveals the high frequency of alternative splicing in the human genome. Nat. Genet. 24: 340341.[CrossRef][Medline]
Dredge, B.K., Polydorides, A.D., and Darnell, R.B. 2001. The splice of life: Alternative splicing and neurological disease. Nat. Rev. Neurosci. 2: 4350.[CrossRef][Medline]
Fairbrother, W.G., Yeh, R.F., Sharp, P.A., and Burge, C.B. 2002. Predictive identification of exonic splicing enhancers in human genes. Science 297: 10071013.
Galante, P.A., Sakabe, N.J., Kirschbaum-Slager, N., and de Souza, S.J. 2004. Detection and evaluation of intron retention events in the human transcriptome. RNA 10: 757765.
Garcia-Blanco, M.A., Baraniak, A.P., and Lasda, E.L. 2004. Alternative splicing in disease and therapy. Nat. Biotechnol. 22: 535546.[CrossRef][Medline]
Ibrahim el, C., Schaal, T.D., Hertel, K.J., Reed, R., and Maniatis, T. 2005. Serine/arginine-rich protein-dependent suppression of exon skipping by exonic splicing enhancers. Proc. Natl. Acad. Sci. 102: 50025007.
Itoh, H., Washio, T., and Tomita, M. 2004. Computational comparative analyses of alternative splicing regulation using full-length cDNA of various eukaryotes. RNA 10: 10051018.
Johnson, J.M., Castle, J., Garrett-Engele, P., Kan, Z., Loerch, P.M., Armour, C.D., Santos, R., Schadt, E.E., Stoughton, R., and Shoemaker, D.D. 2003. Genome-wide survey of human alternative pre-mRNA splicing with exon junction microarrays. Science 302: 21412144.
Kan, Z., Castle, J., Johnson, J.M., and Tsinoremas, N.F. 2004. Detection of novel splice forms in human and mouse using cross-species approach. Pac. Symp. Biocomput. 4253.
Kazazian Jr., H.H 2004. Mobile elements: Drivers of genome evolution. Science 303: 16261632.
Kondrashov, F.A. and Koonin, E.V. 2001. Origin of alternative splicing by tandem exon duplication. Hum. Mol. Genet. 10: 26612669.
. 2003. Evolution of alternative splicing: Deletions, insertions and origin of functional parts of proteins from intron sequences. Trends Genet. 19: 115119.[CrossRef][Medline]
Lander, E.S., Linton, L.M., Birren, B. Nusbaum, C., Zody, M.C., Baldwin, J., Devon, K., Dewar, K., Doyle, M., FitzHugh, W., et al. 2001. Initial sequencing and analysis of the human genome. Nature 409: 860921.[CrossRef][Medline]
Lev-Maor, G., Sorek, R., Shomron, N., and Ast, G. 2003. The birth of an alternatively spliced exon: 3' splice site selection in Alu exons. Science 300: 12881291.
Miller, W.J. and Capy, P. 2004. Mobile genetic elements as natural tools for genome evolution. Methods Mol. Biol. 260: 120.[Medline]
Modrek, B. and Lee, C.J. 2003. Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss. Nat. Genet. 34: 177180.[CrossRef][Medline]
Modrek, B., Resch, A., Grasso, C., and Lee, C. 2001. Genome-wide detection of alternative splicing in expressed sequences of human genes. Nucleic Acids Res. 29: 28502859.
Philipps, D.L., Park, J.W., and Graveley, B.R. 2004. A computational and experimental approach toward a priori identification of alternatively spliced exons. RNA 10: 18381844.
Quentin, Y. 1994. A master sequence related to a free left Alu monomer (FLAM) at the origin of the B1 family in rodent genomes. Nucleic Acids Res. 22: 22222227.
Reed, R. and Maniatis, T. 1986. A role for exon sequences and splice-site proximity in splice-site selection. Cell 46: 681690.[CrossRef][Medline]
Schwartz, S., Zhang, Z., Frazer, K.A., Smit, A., Riemer, C., Bouck, J., Gibbs, R., Hardison, R., and Miller, W. 2000. PipMakera web server for aligning two genomic DNA sequences. Genome Res. 10: 577586.
Senapathy, P., Shapiro, M.B., and Harris, N.L. 1990. Splice junctions, branch point sites, and exons: Sequence statistics, identification, and applications to genome project. Methods Enzymol. 183: 252278.[Medline]
Sorek, R. and Ast, G. 2003. Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. Genome Res. 13: 16311637.
Sorek, R., Ast, G., and Graur, D. 2002. Alu-containing exons are alternatively spliced. Genome Res. 12: 10601067.
Sorek, R., Lev-Maor, G., Reznik, M., Dagan, T., Belinky, F., Graur, D., and Ast, G. 2004a. Minimal conditions for exonization of intronic sequences: 5' splice site formation in alu exons. Mol. Cell. 14: 221231.[CrossRef][Medline]
Sorek, R., Shemesh, R., Cohen, Y., Basechess, O., Ast, G., and Shamir, R. 2004b. A non-EST-based method for exon-skipping prediction. Genome Res. 14: 16171623.
Stamm, S., Zhu, J., Nakai, K., Stoilov, P., Stoss, O., and Zhang, M.Q. 2000. An alternative-exon database and its statistical analysis. DNA Cell. Biol. 19: 739756.[CrossRef][Medline]
Sugnet, C.W., Kent, W.J., Ares M., Jr., and Haussler, D. 2004. Transcriptome and genome conservation of alternative splicing events in humans and mice. Pac. Symp. Biocomput. 6677.
Thanaraj, T.A. and Stamm, S. 2003. Prediction and statistical analysis of alternatively spliced exons. Prog. Mol. Subcell. Biol. 31: 131.[Medline]
van Helden, J., Andre, B., and Collado-Vides, J. 2000. A web site for the computational analysis of yeast regulatory sequences. Yeast 16: 177187.[CrossRef][Medline]
Wang, Z., Rolish, M.E., Yeo, G., Tung, V., Mawson, M., and Burge, C.B. 2004. Systematic identification and analysis of exonic splicing silencers. Cell 119: 831845.[CrossRef][Medline]
Waterston, R.H., Lindblad-Toh, K., Birney, E. Rogers, J., Abril, J.F., Agarwal, P., Agarwala, R., Ainscough, R., Alexandersson, M., An, P., et al. 2002. Initial sequencing and comparative analysis of the mouse genome. Nature 420: 520562.[CrossRef][Medline]
Wheeler, D.L., Church, D.M., Edgar, R. Federhen, S., Helmberg, W., Madden, T.L., Pontius, J.U., Schuler, G.D., Schriml, L.M., Sequeira, E., et al. 2004. Database resources of the National Center for Biotechnology Information: Update. Nucleic Acids Res. 32: D35D40.
Xie, H., Zhu, W.Y., Wasserman, A., Grebinskiy, V., Olson, A., and Mintz, L. 2002. Computational analysis of alternative splicing using EST tissue information. Genomics 80: 326330.[CrossRef][Medline]
Yeo, G., Hoon, S., Venkatesh, B., and Burge, C.B. 2004. Variation in sequence and organization of splicing regulatory elements in vertebrate genes. Proc. Natl. Acad. Sci. 101: 1570015705.
Zavolan, M., Kondo, S., Schonbach, C., Adachi, J., Hume, D.A., Hayashizaki, Y., and Gaasterland, T. 2003. Impact of alternative initiation, splicing, and termination on the diversity of the mRNA transcripts encoded by the mouse transcriptome. Genome Res. 13: 12901300.
Zhang, X.H. and Chasin, L.A. 2004. Computational definition of sequence motifs governing constitutive exon splicing. Genes & Dev. 18: 12411250.
Zheng, C.L., Nair, T.M., Gribskov, M., Kwon, Y.S., Li, H.R., and Fu, X.D. 2004. A database designed to computationally aid an experimental approach to alternative splicing. Pac. Symp. Biocomput. 7888.
Zheng, C.L, Kwon, Y.-S., Li, H.-R., Zhang, K., Coutinho-Mansfield, G., Yang, C., Nair T.M., Gribskov, M., and Fu, X.-D. 2005. MAASE: An alternative splicing database designed for supporting splicing microarray applications. RNA (this issue).
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
I. Aznarez, Y. Barash, O. Shai, D. He, J. Zielenski, L.-C. Tsui, J. Parkinson, B. J. Frey, J. M. Rommens, and B. J. Blencowe A systematic analysis of intronic sequences downstream of 5' splice sites reveals a widespread role for U-rich motifs and TIA1/TIAL1 proteins in alternative splicing regulation Genome Res., August 1, 2008; 18(8): 1247 - 1258. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Goren, E. Kim, M. Amit, R. Bochner, G. Lev-Maor, N. Ahituv, and G. Ast Alternative approach to a heavy weight problem Genome Res., February 1, 2008; 18(2): 214 - 220. [Abstract] [Full Text] [PDF] |