Bioinformatic
analysis of exon repetition, exon scrambling and
trans-splicing in humans
Xiang Shao, Valery Shepelev, Alexei Fedorov
BIOINFORMATICS-2005-1363 ABSTRACT
Motivation: Using bioinformatic
approaches we aimed to characterize poorly-understood
abnormalities in splicing known as exon scrambling,
exon repetition, and trans-splicing.
Results: We developed
a software package that allows large-scale comparison
of all human EST sequences to the entire set of
human gene sequences. Among 5,992,495 Expressed
Sequence Tag (EST) sequences, 401 cases of exon
repetition and 416 cases of exon scrambling were
found. The vast majority of identified ESTs contains
fragments rather than full-length repeated or
scrambled exons. Their structures suggest that
the scrambled or repeated exon fragments may have
arisen in the process of cDNA cloning and not
from splicing abnormalities. Nevertheless, we
found 11 cases of full-length exon repetition
showing that this phenomenon is real yet very
rare. In searching for examples of trans-splicing,
we looked only at reproducible events where at
least two independent ESTs represent the same
putative trans-splicing event. We found 15 ESTs
representing five types of putative trans-splicing.
However, all 15 cases were derived from human
malignant tissues and could have resulted from
genomic rearrangements. Our results provide support
for a very rare but physiological occurrence of
exon repetition, but suggest that apparent exon
scrambling and trans-splicing result respectively
from in vitro artifact and gene-level abnormalities.
PROGRAMS:
ESR1.pl
ESR2.pl
ESR3.pl
ESR4.pl
ESR5.pl
ESR6.pl
TRANSSPLICING1.pl
TRANSSPLICING2.pl
SUPPLEMENTARY FILE:
ES_ER_TS.doc
QUESTIONS about software to Xiang
Shao
|