Eukaryotic mRNAs encoding abundant and scarce proteins are statistically dissimilar in many structural features

FEBS Lett. 1998 Dec 4;440(3):351-5. doi: 10.1016/s0014-5793(98)01482-3.

Abstract

It is well known that non-coding mRNA sequences are dissimilar in many structural features. For individual mRNAs correlations were found for some of these features and their translational efficiency. However, no systematic statistical analysis was undertaken to relate protein abundance and structural characteristics of mRNA encoding the given protein. We have demonstrated that structural and contextual features of eukaryotic mRNAs encoding high- and low-abundant proteins differ in the 5' untranslated regions (UTR). Statistically, 5' UTRs of low-expression mRNAs are longer, their guanine plus cytosine content is higher, they have a less optimal context of the translation initiation codons of the main open reading frames and contain more frequently upstream AUG than 5' UTRs of high-expression mRNAs. Apart from the differences in 5' UTRs, high-expression mRNAs contain stronger termination signals. Structural features of low- and high-expression mRNAs are likely to contribute to the yield of their protein products.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 5' Untranslated Regions / chemistry
  • Base Composition
  • Codon, Initiator
  • Codon, Terminator
  • Databases, Factual
  • Electronic Data Processing
  • Eukaryotic Cells
  • Mathematical Computing
  • Nucleic Acid Conformation
  • Proteins / genetics*
  • RNA, Messenger / chemistry*
  • Software

Substances

  • 5' Untranslated Regions
  • Codon, Initiator
  • Codon, Terminator
  • Proteins
  • RNA, Messenger