Intron size, abundance, and distribution within untranslated regions of genes

Xin Hong, Douglas G. Scofield, Michael Lynch

Research output: Contribution to journalArticle

101 Citations (Scopus)

Abstract

Most research concerning the evolution of introns has largely considered introns within coding sequences (CDSs), without regard for introns located within untranslated regions (UTRs) of genes. Here, we directly determined intron size, abundance, and distribution in UTRs of genes using full-length cDNA libraries and complete genome sequences for four species, Arabidopsis thaliana, Drosophila melanogaster, human, and mouse. Overall intron occupancy (introns/exon kbp) is lower in 5′ UTRs than CDSs, but intron density (intron occupancy in regions containing introns) tends to be higher in 5′ UTRs than in CDSs. Introns in 5′ UTRs are roughly twice as large as introns in CDSs, and there is a sharp drop in intron size at the 5′ UTR-CDS boundary. We propose a mechanistic explanation for the existence of selection for larger intron size in 5′ UTRs, and outline several implications of this hypothesis. We found introns to be randomly distributed within 5′ UTRs, so long as a minimum required exon size was assumed. Introns in 3′ UTRs were much less abundant than in 5′ UTRs. Though this was expected for human and mouse that have intron-dependent nonsense-mediated decay (NMD) pathways that discourage the presence of introns within the 3′ UTR, it was also true for A. thaliana and D. melanogaster, which may lack intron-dependent NMD. Our findings have several implications for theories of intron evolution and genome evolution in general.

Original languageEnglish (US)
Pages (from-to)2392-2404
Number of pages13
JournalMolecular Biology and Evolution
Volume23
Issue number12
DOIs
StatePublished - Dec 1 2006
Externally publishedYes

Fingerprint

Untranslated Regions
Introns
introns
gene
Genes
5' Untranslated Regions
5' untranslated regions
genes
genome
sequence boundary
distribution
3' Untranslated Regions
3' untranslated regions
Drosophila melanogaster
Arabidopsis
exons
Exons
Arabidopsis thaliana
deterioration
Genome

Keywords

  • Genome evolution
  • Intron
  • Untranslated region

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Molecular Biology
  • Genetics

Cite this

Intron size, abundance, and distribution within untranslated regions of genes. / Hong, Xin; Scofield, Douglas G.; Lynch, Michael.

In: Molecular Biology and Evolution, Vol. 23, No. 12, 01.12.2006, p. 2392-2404.

Research output: Contribution to journalArticle

@article{0ea0c16532aa4a50aa6525bd905c1865,
title = "Intron size, abundance, and distribution within untranslated regions of genes",
abstract = "Most research concerning the evolution of introns has largely considered introns within coding sequences (CDSs), without regard for introns located within untranslated regions (UTRs) of genes. Here, we directly determined intron size, abundance, and distribution in UTRs of genes using full-length cDNA libraries and complete genome sequences for four species, Arabidopsis thaliana, Drosophila melanogaster, human, and mouse. Overall intron occupancy (introns/exon kbp) is lower in 5′ UTRs than CDSs, but intron density (intron occupancy in regions containing introns) tends to be higher in 5′ UTRs than in CDSs. Introns in 5′ UTRs are roughly twice as large as introns in CDSs, and there is a sharp drop in intron size at the 5′ UTR-CDS boundary. We propose a mechanistic explanation for the existence of selection for larger intron size in 5′ UTRs, and outline several implications of this hypothesis. We found introns to be randomly distributed within 5′ UTRs, so long as a minimum required exon size was assumed. Introns in 3′ UTRs were much less abundant than in 5′ UTRs. Though this was expected for human and mouse that have intron-dependent nonsense-mediated decay (NMD) pathways that discourage the presence of introns within the 3′ UTR, it was also true for A. thaliana and D. melanogaster, which may lack intron-dependent NMD. Our findings have several implications for theories of intron evolution and genome evolution in general.",
keywords = "Genome evolution, Intron, Untranslated region",
author = "Xin Hong and Scofield, {Douglas G.} and Michael Lynch",
year = "2006",
month = "12",
day = "1",
doi = "10.1093/molbev/msl111",
language = "English (US)",
volume = "23",
pages = "2392--2404",
journal = "Molecular Biology and Evolution",
issn = "0737-4038",
publisher = "Oxford University Press",
number = "12",

}

TY - JOUR

T1 - Intron size, abundance, and distribution within untranslated regions of genes

AU - Hong, Xin

AU - Scofield, Douglas G.

AU - Lynch, Michael

PY - 2006/12/1

Y1 - 2006/12/1

N2 - Most research concerning the evolution of introns has largely considered introns within coding sequences (CDSs), without regard for introns located within untranslated regions (UTRs) of genes. Here, we directly determined intron size, abundance, and distribution in UTRs of genes using full-length cDNA libraries and complete genome sequences for four species, Arabidopsis thaliana, Drosophila melanogaster, human, and mouse. Overall intron occupancy (introns/exon kbp) is lower in 5′ UTRs than CDSs, but intron density (intron occupancy in regions containing introns) tends to be higher in 5′ UTRs than in CDSs. Introns in 5′ UTRs are roughly twice as large as introns in CDSs, and there is a sharp drop in intron size at the 5′ UTR-CDS boundary. We propose a mechanistic explanation for the existence of selection for larger intron size in 5′ UTRs, and outline several implications of this hypothesis. We found introns to be randomly distributed within 5′ UTRs, so long as a minimum required exon size was assumed. Introns in 3′ UTRs were much less abundant than in 5′ UTRs. Though this was expected for human and mouse that have intron-dependent nonsense-mediated decay (NMD) pathways that discourage the presence of introns within the 3′ UTR, it was also true for A. thaliana and D. melanogaster, which may lack intron-dependent NMD. Our findings have several implications for theories of intron evolution and genome evolution in general.

AB - Most research concerning the evolution of introns has largely considered introns within coding sequences (CDSs), without regard for introns located within untranslated regions (UTRs) of genes. Here, we directly determined intron size, abundance, and distribution in UTRs of genes using full-length cDNA libraries and complete genome sequences for four species, Arabidopsis thaliana, Drosophila melanogaster, human, and mouse. Overall intron occupancy (introns/exon kbp) is lower in 5′ UTRs than CDSs, but intron density (intron occupancy in regions containing introns) tends to be higher in 5′ UTRs than in CDSs. Introns in 5′ UTRs are roughly twice as large as introns in CDSs, and there is a sharp drop in intron size at the 5′ UTR-CDS boundary. We propose a mechanistic explanation for the existence of selection for larger intron size in 5′ UTRs, and outline several implications of this hypothesis. We found introns to be randomly distributed within 5′ UTRs, so long as a minimum required exon size was assumed. Introns in 3′ UTRs were much less abundant than in 5′ UTRs. Though this was expected for human and mouse that have intron-dependent nonsense-mediated decay (NMD) pathways that discourage the presence of introns within the 3′ UTR, it was also true for A. thaliana and D. melanogaster, which may lack intron-dependent NMD. Our findings have several implications for theories of intron evolution and genome evolution in general.

KW - Genome evolution

KW - Intron

KW - Untranslated region

UR - http://www.scopus.com/inward/record.url?scp=33750962013&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33750962013&partnerID=8YFLogxK

U2 - 10.1093/molbev/msl111

DO - 10.1093/molbev/msl111

M3 - Article

C2 - 16980575

AN - SCOPUS:33750962013

VL - 23

SP - 2392

EP - 2404

JO - Molecular Biology and Evolution

JF - Molecular Biology and Evolution

SN - 0737-4038

IS - 12

ER -