In language, transcription is the process of matching the sounds of human speech to written symbols using a set of standard rules, so that these sounds can be reproduced later. Usually these rules are organized on a phonetic basis and are specifically constructed in order to be maximally simple. Standard transcription schemes include the International Phonetic Alphabet (IPA), and its ASCII equivalent, SAMPA. One can see numerous examples of transcription on the Common phrases in different languages page (in this particular case, using the standard English spelling rules).
A (simple) model for a bacterial gene to be transcribed looks like this :
upstream ~17 bp The gene to transcribe downstream 5'----------|-35|---------|-10|----------------------|T|------------3' 3'----------|-35|---------|-10|----------------------|T|------------5' | |---------------------> mRNA
- The RNAP recognizes the promoter region of the gene and binds to the DNA at that specific location. At this stage, the DNA is still double-stranded and called closed complex.
- The DNA is unwound and becomes single-stranded at the initiation site (the -10 promoter region). This is called open complex.
- The DNA is melted (the strands are locally separated), the σ-factor leaves the holoenzyme, and the transcription process begins. This is the elongation phase.
The RNAP runs along the DNA, synthesizing mRNA in the process. In bacteria, the nascending mRNA is processed right away by ribosomes.
The elongation stops if:
- The terminator is reached. The terminator is usually a palindromic DNA sequence that forms a hairpin.
- A ρ factor (a protein) binds and runs along the mRNA towards the RNAP. When ρ-factor reaches the RNAP, it causes RNAP to dissociate from the DNA, terminating transcription.
- The RNAP comes across a region with repetitious base pairs (for example, TTTTTT). This will terminate transcription.
Gene expression in eukaryotes is largely controlled by transcription via transcription factors. As eukaryotes are much more complex than prokaryotes, and have their genetic material stored in the nucleus, the transcription mechanisms are more complicated here. For example, eukaryotes have three RNA polymerases, in contrast to prokaryotes, which only have one.
- RNA Polymerase I is located in the nucleolus and transcribes only rRNAs.
- RNA Polymerase II is the "standard" RNAP.
- RNA Polymerase III transcribes tRNAs and other small RNAs.
Also, eukrayotic RNAPs need specific accessory proteins to become active. The C-terminus of all RNAPs is highly conserved and contains the actual transctiptional mechanism.
The core promoter of eukaryotic genes stretches from position -45 to 0. Additionally, there can be an upstream control element present at the -180 to -107 region, which can amplify the RNAP binding by a factor of up to 100. This UCE usually contains a TATA box, a highly conserved DNA sequence that reads
T A T A T/A A
A similar sequence, thus not that highly conserved, is found in the INR element (initiator element, part of the complex core promoter).
A major difference between prokaryotic and eukaryotic transcription is that the latter have splicing of the primary transcript, modifying the mRNA created during transcription.
- See also : signal transduction