A codon table can be used to translate a genetic code into a sequence of amino acids.[1] The standard genetic code is traditionally represented as an RNA codon table, because when proteins are made in a cell by ribosomes, it is messenger RNA (mRNA) that directs protein synthesis.[2] [3] The mRNA sequence is determined by the sequence of genomic DNA.[4] In this context, the standard genetic code is referred to as translation table 1.[3] It can also be represented in a DNA codon table. The DNA codons in such tables occur on the sense DNA strand and are arranged in a 5-to-3 direction. Different tables with alternate codons are used depending on the source of the genetic code, such as from a cell nucleus, mitochondrion, plastid, or hydrogenosome.[5]
There are 64 different codons in the genetic code and the below tables; most specify an amino acid.[6] Three sequences, UAG, UGA, and UAA, known as stop codons, do not code for an amino acid but instead signal the release of the nascent polypeptide from the ribosome.[7] In the standard code, the sequence AUG—read as methionine—can serve as a start codon and, along with sequences such as an initiation factor, initiates translation.[3] [8] [9] In rare instances, start codons in the standard code may also include GUG or UUG; these codons normally represent valine and leucine, respectively, but as start codons they are translated as methionine or formylmethionine.[3] [9]
The classical table/wheel of the standard genetic code is arbitrarily organized based on codon position 1. Saier,[10] following observations from,[11] showed that reorganizing the wheel based instead on codon position 2 (and reordering from UCAG to UCGA) better arranges the codons by the hydrophobicity of their encoded amino acids. This suggests that early ribosomes read the second codon position most carefully, to control hydrophobicity patterns in protein sequences.
The first table—the standard table—can be used to translate nucleotide triplets into the corresponding amino acid or appropriate signal if it is a start or stop codon. The second table, appropriately called the inverse, does the opposite: it can be used to deduce a possible triplet code if the amino acid is known. As multiple codons can code for the same amino acid, the International Union of Pure and Applied Chemistry's (IUPAC) nucleic acid notation is given in some instances.
Amino-acid biochemical properties | Nonpolar (np) | Polar (p) | Basic (b) | Acidic (a) | Termination: stop codon * | Initiation: possible start codon ⇒ |
2nd base | 3rd base | ||||||||
---|---|---|---|---|---|---|---|---|---|
U | C | A | G | ||||||
U | UUU | (Phe/F) Phenylalanine (np) | UCU | (Ser/S) Serine (p) | UAU | (Tyr/Y) Tyrosine (p) | UGU | (Cys/C) Cysteine (p) | U |
UUC | UCC | UAC | UGC | C | |||||
UUA | (Leu/L) Leucine (np) | UCA | UAA | Stop (Ochre) * | UGA | Stop (Opal) * | A | ||
UUG ⇒ | UCG | UAG | Stop (Amber) * | UGG | (Trp/W) Tryptophan (np) | G | |||
C | CUU | CCU | (Pro/P) Proline (np) | CAU | (His/H) Histidine (b) | CGU | (Arg/R) Arginine (b) | U | |
CUC | CCC | CAC | CGC | C | |||||
CUA | CCA | CAA | (Gln/Q) Glutamine (p) | CGA | A | ||||
CUG | CCG | CAG | CGG | G | |||||
A | AUU | (Ile/I) Isoleucine (np) | ACU | (Thr/T) Threonine (p) | AAU | (Asn/N) Asparagine (p) | AGU | (Ser/S) Serine (p) | U |
AUC | ACC | AAC | AGC | C | |||||
AUA | ACA | AAA | (Lys/K) Lysine (b) | AGA | (Arg/R) Arginine (b) | A | |||
AUG ⇒ | (Met/M) Methionine (np) | ACG | AAG | AGG | G | ||||
G | GUU | (Val/V) Valine (np) | GCU | (Ala/A) Alanine (np) | GAU | (Asp/D) Aspartic acid (a) | GGU | (Gly/G) Glycine (np) | U |
GUC | GCC | GAC | GGC | C | |||||
GUA | GCA | GAA | (Glu/E) Glutamic acid (a) | GGA | A | ||||
GUG ⇒ | GCG | GAG | GGG | G | |||||
As shown in the above table, NCBI table 1 includes the less-canonical start codons GUG and UUG.[3]
Amino acid | RNA codons | Compressed | Amino acid | RNA codons | Compressed | |
---|---|---|---|---|---|---|
Ala, A | GCU, GCC, GCA, GCG | GCN | Ile, I | AUU, AUC, AUA | AUH | |
Arg, R | CGU, CGC, CGA, CGG; AGA, AGG | CGN, AGR; or CGY, MGR | Leu, L | CUU, CUC, CUA, CUG; UUA, UUG | CUN, UUR; or CUY, YUR | |
Asn, N | AAU, AAC | AAY | Lys, K | AAA, AAG | AAR | |
Asp, D | GAU, GAC | GAY | Met, M | AUG | ||
Asn or Asp, B | AAU, AAC; GAU, GAC | RAY | Phe, F | UUU, UUC | UUY | |
Cys, C | UGU, UGC | UGY | Pro, P | CCU, CCC, CCA, CCG | CCN | |
Gln, Q | CAA, CAG | CAR | Ser, S | UCU, UCC, UCA, UCG; AGU, AGC | UCN, AGY | |
Glu, E | GAA, GAG | GAR | Thr, T | ACU, ACC, ACA, ACG | ACN | |
Gln or Glu, Z | CAA, CAG; GAA, GAG | SAR | Trp, W | UGG | ||
Gly, G | GGU, GGC, GGA, GGG | GGN | Tyr, Y | UAU, UAC | UAY | |
His, H | CAU, CAC | CAY | Val, V | GUU, GUC, GUA, GUG | GUN | |
START | AUG, CUG, UUG | HUG | STOP | UAA, UGA, UAG | URA, UAG; or UGA, UAR |
Amino-acid biochemical properties | Nonpolar (np) | Polar (p) | Basic (b) | Acidic (a) | Termination: stop codon * | Initiation: possible start codon ⇒ |
2nd base | 3rd base | ||||||||
---|---|---|---|---|---|---|---|---|---|
T | C | A | G | ||||||
T | (Phe/F) Phenylalanine (np) | C | (Ser/S) Serine (p) | A | (Tyr/Y) Tyrosine (p) | G | (Cys/C) Cysteine (p) | ||
C | CC | AC | GC | C | |||||
A | (Leu/L) Leucine (np) | CA | AA | Stop (Ochre) * | GA | Stop (Opal) * | A | ||
G ⇒ | CG | AG | Stop (Amber) * | GG | (Trp/W) Tryptophan (np) | G | |||
C | C | CC | (Pro/P) Proline (np) | CA | (His/H) Histidine (b) | CG | (Arg/R) Arginine (b) | ||
CC | CCC | CAC | CGC | C | |||||
CA | CCA | CAA | (Gln/Q) Glutamine (p) | CGA | A | ||||
CG | CCG | CAG | CGG | G | |||||
A | A | (Ile/I) Isoleucine (np) | AC | (Thr/T) Threonine (p) | AA | (Asn/N) Asparagine (p) | AG | (Ser/S) Serine (p) | |
AC | ACC | AAC | AGC | C | |||||
AA | ACA | AAA | (Lys/K) Lysine (b) | AGA | (Arg/R) Arginine (b) | A | |||
AG ⇒ | (Met/M) Methionine (np) | ACG | AAG | AGG | G | ||||
G | G | (Val/V) Valine (np) | GC | (Ala/A) Alanine (np) | GA | (Asp/D) Aspartic acid (a) | GG | (Gly/G) Glycine (np) | |
GC | GCC | GAC | GGC | C | |||||
GA | GCA | GAA | (Glu/E) Glutamic acid (a) | GGA | A | ||||
GG ⇒ | GCG | GAG | GGG | G | |||||
Amino acid | DNA codons | Compressed | Amino acid | DNA codons | Compressed | |
---|---|---|---|---|---|---|
Ala, A | GC, GCC, GCA, GCG | GCN | Ile, I | A, AC, AA | AH | |
Arg, R | CG, CGC, CGA, CGG; AGA, AGG | CGN, AGR; or CGY, MGR | Leu, L | C, CC, CA, CG; A, G | CN, R; or CY, YR | |
Asn, N | AA, AAC | AAY | Lys, K | AAA, AAG | AAR | |
Asp, D | GA, GAC | GAY | Met, M | AG | ||
Asn or Asp, B | AA, AAC; GA, GAC | RAY | Phe, F | , C | Y | |
Cys, C | G, GC | GY | Pro, P | CC, CCC, CCA, CCG | CCN | |
Gln, Q | CAA, CAG | CAR | Ser, S | C, CC, CA, CG; AG, AGC | CN, AGY | |
Glu, E | GAA, GAG | GAR | Thr, T | AC, ACC, ACA, ACG | ACN | |
Gln or Glu, Z | CAA, CAG; GAA, GAG | SAR | Trp, W | GG | ||
Gly, G | GG, GGC, GGA, GGG | GGN | Tyr, Y | A, AC | AY | |
His, H | CA, CAC | CAY | Val, V | G, GC, GA, GG | GN | |
START | ATG, TTG, GTG, CTG[13] | NTG | STOP | AA, GA, AG | RA, AR |
The genetic code was once believed to be universal: a codon would code for the same amino acid regardless of the organism or source. However, it is now agreed that the genetic code evolves, resulting in discrepancies in how a codon is translated depending on the genetic source.[14] [15] For example, in 1981, it was discovered that the use of codons AUA, UGA, AGA and AGG by the coding system in mammalian mitochondria differed from the universal code.[14] Stop codons can also be affected: in ciliated protozoa, the universal stop codons UAA and UAG code for glutamine.[15] Four novel alternative genetic codes (numbered here 34–37) were discovered in bacterial genomes by Shulgina and Eddy, revealing the first sense codon changes in bacteria.[16] The following table displays these alternative codons.
Amino-acid biochemical properties | Nonpolar (np) | Polar (p) | Basic (b) | Acidic (a) | Termination: stop codon * |
Translation table | DNA codon involved | RNA codon involved | Translation with this code | Standard translation | Notes | ||||
---|---|---|---|---|---|---|---|---|---|
scope=row | Standard | 1 | Includes translation table 8 (plant chloroplasts). | ||||||
scope=row rowspan="4" | Vertebrate mitochondrial | 2 | AGA | AGA | Stop * | Arg (R) (b) | |||
AGG | AGG | Stop * | Arg (R) (b) | ||||||
ATA | AUA | Met (M) (np) | Ile (I) (np) | ||||||
TGA | UGA | Trp (W) (np) | Stop * | ||||||
scope=row rowspan="8" | Yeast mitochondrial | 3 | ATA | AUA | Met (M) (np) | Ile (I) (np) | |||
CTT | CUU | Thr (T) (p) | Leu (L) (np) | ||||||
CTC | CUC | Thr (T) (p) | Leu (L) (np) | ||||||
CTA | CUA | Thr (T) (p) | Leu (L) (np) | ||||||
CTG | CUG | Thr (T) (p) | Leu (L) (np) | ||||||
TGA | UGA | Trp (W) (np) | Stop * | ||||||
CGA | CGA | absent | Arg (R) (b) | ||||||
CGC | CGC | absent | Arg (R) (b) | ||||||
scope=row rowspan="1" | Mold, protozoan, and coelenterate mitochondrial + Mycoplasma / Spiroplasma | 4 | TGA | UGA | Trp (W) (np) | Stop * | Includes the translation table 7 (kinetoplasts). | ||
scope=row rowspan="4" | Invertebrate mitochondrial | 5 | AGA | AGA | Ser (S) (p) | Arg (R) (b) | |||
AGG | AGG | Ser (S) (p) | Arg (R) (b) | ||||||
ATA | AUA | Met (M) (np) | Ile (I) (np) | ||||||
TGA | UGA | Trp (W) (np) | Stop * | ||||||
scope=row rowspan="2" | Ciliate, dasycladacean and Hexamita nuclear | 6 | TAA | UAA | Gln (Q) (p) | Stop * | |||
TAG | UAG | Gln (Q) (p) | Stop * | ||||||
scope=row rowspan="4" | Echinoderm and flatworm mitochondrial | 9 | AAA | AAA | Asn (N) (p) | Lys (K) (b) | |||
AGA | AGA | Ser (S) (p) | Arg (R) (b) | ||||||
AGG | AGG | Ser (S) (p) | Arg (R) (b) | ||||||
TGA | UGA | Trp (W) (np) | Stop * | ||||||
scope=row rowspan="1" | Euplotid nuclear | 10 | TGA | UGA | Cys (C) (p) | Stop * | |||
scope=row rowspan="1" | Bacterial, archaeal and plant plastid | 11 | See translation table 1. | ||||||
scope=row rowspan="1" | Alternative yeast nuclear | 12 | CTG | CUG | Ser (S) (p) | Leu (L) (np) | |||
scope=row rowspan="4" | Ascidian mitochondrial | 13 | AGA | AGA | Gly (G) (np) | Arg (R) (b) | |||
AGG | AGG | Gly (G) (np) | Arg (R) (b) | ||||||
ATA | AUA | Met (M) (np) | Ile (I) (np) | ||||||
TGA | UGA | Trp (W) (np) | Stop * | ||||||
scope=row rowspan="5" | Alternative flatworm mitochondrial | 14 | AAA | AAA | Asn (N) (p) | Lys (K) (b) | |||
AGA | AGA | Ser (S) (p) | Arg (R) (b) | ||||||
AGG | AGG | Ser (S) (p) | Arg (R) (b) | ||||||
TAA | UAA | Tyr (Y) (p) | Stop * | ||||||
TGA | UGA | Trp (W) (np) | Stop * | ||||||
scope=row rowspan="1" | Blepharisma nuclear | 15 | TAG | UAG | Gln (Q) (p) | Stop * | As of Nov. 18, 2016: absent from the NCBI update. Similar to translation table 6. | ||
scope=row rowspan="1" | Chlorophycean mitochondrial | 16 | TAG | UAG | Leu (L) (np) | Stop * | |||
scope=row rowspan="5" | Trematode mitochondrial | 21 | TGA | UGA | Trp (W) (np) | Stop * | |||
ATA | AUA | Met (M) (np) | Ile (I) (np) | ||||||
AGA | AGA | Ser (S) | Arg (R) (b) | ||||||
AGG | AGG | Ser (S) (p) | Arg (R) (b) | ||||||
AAA | AAA | Asn (N) (p) | Lys (K) (b) | ||||||
scope=row rowspan="2" | Scenedesmus obliquus mitochondrial | 22 | TCA | UCA | Stop * | Ser (S) (p) | |||
TAG | UAG | Leu (L) (np) | Stop * | ||||||
scope=row rowspan="1" | Thraustochytrium mitochondrial | 23 | TTA | UUA | Stop * | Leu (L) (np) | Similar to translation table 11. | ||
scope=row rowspan="3" | Pterobranchia mitochondrial | 24 | AGA | AGA | Ser (S) (p) | Arg (R) (b) | |||
AGG | AGG | Lys (K) (b) | Arg (R) (b) | ||||||
TGA | UGA | Trp (W) (np) | Stop * | ||||||
scope=row rowspan="1" | Candidate division SR1 and Gracilibacteria | 25 | TGA | UGA | Gly (G) (np) | Stop * | |||
scope=row rowspan="1" | Pachysolen tannophilus nuclear | 26 | CTG | CUG | Ala (A) (np) | Leu (L) (np) | |||
scope=row rowspan="3" | Karyorelict nuclear | 27 | TAA | UAA | Gln (Q) (p) | Stop * | |||
TAG | UAG | Gln (Q) (p) | Stop * | ||||||
TG | UGA | Stop * | or | Trp (W) (np) | Stop * | ||||
scope=row rowspan="3" | Condylostoma nuclear | 28 | TAA | UAA | Stop * | or | Gln (Q) (p) | Stop * | |
TAG | UAG | Stop * | or | Gln (Q) (p) | Stop * | ||||
TGA | UGA | Stop * | or | Trp (W) (np) | Stop * | ||||
scope=row rowspan="2" | Mesodinium nuclear | 29 | TAA | UAA | Tyr (Y) (p) | Stop * | |||
TAG | UAG | Tyr (Y) (p) | Stop * | ||||||
scope=row rowspan="2" | Peritrich nuclear | 30 | TA | UAA | Glu (E) (a) | Stop * | |||
TAG | UAG | Glu (E) (a) | Stop * | ||||||
scope=row rowspan="3" | Blastocrithidia nuclear | 31 | TAA | UAA | Stop * | or | Glu (E) (a) | Stop * | |
TAG | UAG | Stop * | or | Glu (E) (a) | Stop * | ||||
TGA | UGA | Trp (W) (np) | Stop * | ||||||
scope=row rowspan="4" | Cephalodiscidae mitochondrial code | 33 | AGA | AGA | Ser (S) (p) | Arg (R) (b) | Similar to translation table 24. | ||
AGG | AGG | Lys (K) (b) | Arg (R) (b) | ||||||
TAA | UAA | Tyr (Y) (p) | Stop * | ||||||
TGA | UGA | Trp (W) (np) | Stop * | ||||||
scope=row rowspan="1" | Enterosoma | 34 | AGG | AGG | Met (M) (np) | Arg (R) (b) | |||
scope=row rowspan="1" | Peptacetobacter | 35 | CGG | CGG | Gln (Q) (p) | Arg (R) (b) | |||
scope=row rowspan="1" | Anaerococcus and Onthovivens | 36 | CGG | CGG | Trp (W) (np) | Arg (R) (b) | |||
scope=row rowspan="3" | Absconditabacteraceae | 37 | CGA | CGA | Trp (W) (np) | Arg (R) (b) | |||
CGG | CGG | Trp (W) (np) | Arg (R) (b) | ||||||
TGA | UGA | Gly (G) (np) | Stop * |