The coding regions (exons; form expressed sequences) of most eukaryotic genes are interrupted by few to several non coding sequences called introns from intervening sequences such genes are called split genes or interrupted genes since their coding sequences are split into several parts due to the introns.
But some genes of eukaryotes are not split, e.g. histone genes of sea urchin and four treat shock genes of Drosophila. On the other hand, split genes have been found in an archaebacterium and a phage of E. coli.
The first split gene to be described in 1977 by Chambdon and coworkers was the ovalbumin gene of chicken coding for the 386 amino acid long ovalbumin protein eggs.
ADVERTISEMENTS:
The discovery of split genes was made in 1977 by several groups in a variety of materials, which included the (i) Two research groups separately headed by Phillip A. Sharp and Richard J. Roberts studied genes of adenovirus 2.
(ii) Research groups of D.S. Hogness, I.B. David and N. Davidson studied genes for 28S rRNA in Drosophila.
ADVERTISEMENTS:
(iii) Research groups of P. Chambon, P. Leder and R. A. Flavell studied 6 globin genes, ovalbumin genes and tRNA genes.
In all these cases the genes were found to be interrupted by intervening sequences. The credit for discovery of split genes, however, goes to Phillip Sharp and Richard Roberts, who won the 1993 Nobel Prize for medicine for their work on split genes.
They analyzed the hybrids of late mRNA of adenovirus 2 with the adenovirus genomic DNA. When these mRNA-DNA hybrids were examined under electron microscope, the adjoining sequences of mRNA were found to hybridize with discontinuous stretches of genomic DNA of adenovirus 2.
The intervening DNA sequences were observed as loops and the phenomenon was later described as R looping.
ADVERTISEMENTS:
Chambon’s group compiled sequences of the boundaries of introns from a large number of protein coding eukaryotic genes (not ribosomal RNA or tRNA genes), which revealed the presence of consensus sequences at the intron-exon junctions.
These GT was always found at the 5′ side of the intron (left splice junction) and AG at the 3′ side (right splice junction). This became popularly known as GT-AG or Chambon’s rule.
Some of the diseases (e.g. thalassemias) are caused by mutations, which created or abolished these splice junctions.
Phillip Sharp’s group and other groups later also conducted studies to elucidate the mechanisms of splicing.