MIF4GD, or MIF4G domain-containing protein, is a protein which in humans is encoded by the MIF4GD gene.[1] It is also known as SLIP1, SLBP (Stem-Loop Binding Protein)-interacting protein 1, AD023, and MIFD.[2] [3] MIF4GD is expressed ubiquitously in humans, and has been found to be involved in activating proteins for histone mRNA translation, alternative splicing and translation of mRNAs, and is a factor in the regulation of cell proliferation.[4] [5] [6]
The MIF4GD gene is located in humans on the minus strand of chromosome 17q25.1, and spans 5.0 Kb, from bases 75,266,228 to 75,271,292.
There are 11 alternatively-spliced mRNA transcripts and 3 unspliced mRNA transcripts that can be transcribed from this gene, which include 7 possible exons and 11 distinct introns.[7]
There are 10 viable isoforms of the MIF4G domain-containing protein. The longest isoform is MIF4G domain-containing protein isoform 1, which is 263 amino acids long, however, the most common isoform is MIF4G domain-containing protein isoform 4, which consists of 6 exons and is 222 amino acids in length.
MIF4G domain-containing protein isoform 1 has a predicted molecular weight of 30.1 kDa, and a predicted isoelectric point of 5.2, indicating that it is an acidic protein.[8] It has a normal ratio of each amino acid when compared to the average human protein.[9] Additionally, MIF4GD is expected to form 11 alpha helices.[10] [11] [12]
Searches of MIF4GD antibodies showed that MIF4GD is present in the cytoplasm and nucleoli of cells.[13] [14] Additionally, several bioinformatic programs predict human MIF4GD, as well as several of its orthologs, are present in the cytoplasm, nucleus and mitochondria of cells.[15]
Due to its presumed localization in the cytoplasm, it is predicted that MIF4GD could be phosphorylated, acetylated, ubiquitinated, or sumoylated. Additionally, MIF4GD is predicted to contain a "YinOYang" site at S61, which may be either O-GlcNAcylated or phosphorylated at different times for regulatory purposes.[16] It is not likely that the MIF4GD protein will be lipid-linked or glycosylated.[17] [18] [19]
The MIF4GD protein that contains an MIF4G domain, which is named after the middle domain of eukaryotic initiation factor 4G (eIF4G).[20] The MIF4G domain of the MIF4GD protein has a molecular weight of 17.0 kDa, and has a predicted isoelectric point of 5.7. Similar to the entire protein, it contains normal ratios of each amino acid relative to a reference of human proteins, however, it contains less negatively-charged amino acids and more positively-charged amino acids relative to the entire protein. The MIF4G domain is predicted to contain many alpha-helices and is thought to contain alpha-helical repeats.
MIF4GD is found only in animals, and is expressed ubiquitously in the body, though it has been discovered to be expressed at a somewhat higher rate in lymph nodes, bone marrow and testes.[21] MIF4GD is expressed at an average rate that is 1.7 times higher than the average gene.
The promoter region of MIF4GD is approximately 1137 nucleotide base pairs long, and is predicted to interact with various transcription factors.[22] The 5' untranslated region of MIF4GD mRNA transcripts is relatively short, at a length of around 137 nucleotides, and is predicted to form stem-loops and interior-loops to which RNA-binding proteins may bind.[23] [24] The 3' untranslated region is longer, at a length of approximately 510 nucleotides. The 3' UTR is also predicted to form stem-loops, interior-loops, and bulge-loops, as well as more complex secondary structures, and is predicted to bind to RNA-binding proteins and miRNAs at or near these sites.[25]
MIF4GD has been experimentally shown to bind to various other proteins, many of which play a role in alternative splicing of pre-mRNAs and translation of mRNAs into proteins.[26] It also is known to interact with eukaryotic translation initiation factors, RNA, and DNA to form a translation initiation complex. Some of the most notable proteins that interact with MIF4GD are:
ATP-dependent RNA helicases DDX19A and DDX19B,[27] which is involved in mRNA export from the nucleus and helicase activity by facilitating the disassociation of nuclear mRNA binding proteins and replacement with cytoplasmic mRNA binding proteins.[28]
Cap binding complex dependent translation initiation factor, or CTIF,[29] which is a paralog of MIF4GD. CTIF binds cotranscriptionally to the cap end of the nascent mRNA, and is involved in simultaneous editing and translation of mRNA that happens directly after export from the nucleus.[30]
Histone RNA hairpin-binding protein, or SLBP,[31] which is involved in histone pre-mRNA processing and movement of mRNAs from the nucleus to the cytoplasm of cells.[32]
Supervillin, or SVIL,[33] which is a peripheral membrane protein that forms a high-affinity link between the actin cytoskeleton and the membrane and contributes to myogenic membrane structure and differentiation.[34] Supervillin also regulates cell spreading and motility during the cell cycle.
MIF4GD also has been verified by two-hybrid bait-prey experiments to interact with NSP7ab, or Non-structural protein 7, of SARS-CoV.[35]
MIF4GD has several known functions, including the activation of proteins that bind histone mRNAs for translation and binding of mRNAs for alternative splicing and translation into proteins. Additionally, down-regulation of the SLIP1/MIF4GD gene and corresponding protein results in a reduced rate of histone mRNA translation and reduced cell viability. Therefore, it is speculated to be needed in eukaryotic cells in order to produce proteins and for cell proliferation.
MIF4GD has been shown to bind and stabilize p27kip1, which plays an important role in the regulating the cell cycle and in cancer progression. When bound to MIF4GD, the stabilized protein suppresses phosphorylation by CDK2 at T187, which controls the amount of cell proliferation in hepatocellular carcinoma (HCC). Regulation of this interaction is being studied as a potential therapeutic treatment for patients with hepatocellular carcinoma. This provides more evidence that MIF4GD helps regulate cell proliferation, and suggests MIF4GD may play a role in immune response.
MIF4GD is found in Animalia, and first appeared in Porifera, which diverged from Homo sapiens around 777 million years ago.[36] Relative to humans, this gene is highly conserved (>80% identity and >90% similarity) in mammals and reptiles, moderately conserved (>70% identity and >85% similarity) in chordates, and low levels of conservation (15-25% identity and 25-40% similarity) to the rest of Animalia. MIF4GD is not present in trichoplax, fungi, plants, protists, archaea or bacteria.[37]
There are currently 310 known and sequenced MIF4GD orthologs found in Animalia. A select number of these orthologs have been analyzed for estimated time of divergence (in millions of years), amino acid sequence identity to humans, and amino acid sequence similarity to humans. The results are shown in the table below:
Homo sapiens | Human | NP_001229430 | 0 | 100 | 100 | |
Pan paniscus | Bonobo | XP_034798762 | 6.4 | 100 | 100 | |
Mus musculus | House mouse | NP_001230513 | 89 | 93.2 | 97.7 | |
Vombatus ursinus | Common wombat | XP_027728462 | 160 | 91.0 | 95.9 | |
Ornithorhynchus anatinus | Platypus | XP_028912780 | 180 | 77.9 | 90.5 | |
Crocodylus porosus | Saltwater Crocodile | XP_019398085 | 318 | 85.1 | 91.4 | |
Gallus gallus | Chicken | XP_015150938 | 318 | 83.8 | 90.1 | |
Xenopus tropicalis | Tropical clawed frog | NP_001016440 | 351.7 | 74.4 | 84.8 | |
Danio rerio | Zebrafish | NP_001013302 | 433 | 73.9 | 86.0 | |
Rhincodon typus | Whale shark | XP_020392528 | 465 | 71.2 | 85.1 | |
Petromyzom marinus | Sea lamprey | XP_032832018 | 599 | 48.7 | 69.4 | |
Exaiptasia pallida | Pale anemone | XP_020912437 | 687 | 22.6 | 37.3 | |
Limulus polyphemus | Atlantic horseshoe crab | XP_013791968 | 736 | 22.5 | 39.5 | |
Parasteatoda tepidariorum | Common house spider | XP_015912223 | 736 | 19.5 | 33.9 | |
Drosophila virilis | Fruit fly | XP_015028674 | 736 | 16.2 | 29.6 | |
Temnothorax curvispinosus | Ant | XP_024872082 | 736 | 14.1 | 25.6 | |
Amphimedon queenslandica | Sponge | XP_011404567 | 777 | 20.4 | 39.6 |
MIF4GD has two known paralogs, which are PAIP1 and CTIF.[39] Both known paralogs have moderate to low conservation to MIF4GD, with less than 15% identity and between 20 and 25% similarity. However, both of these genes are predicted to have diverged before the evolution of orthologs, and scored E-values of nearly zero, indicating a significant relationship with MIF4GD.
MIF4GD is a slowly-evolving gene, with an approximate average of 75 amino acid changes per hundred amino acids per million years. Multiple sequence alignments of human MIF4GD and its orthologs showed two conserved amino acids throughout all sequences, which are Gly200 and Glu241.