C12orf60 Explained

Uncharacterized protein C12orf60 is a protein that in humans (Homo sapiens) is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.[1]

The C12orf60 mature mRNA transcript is 1139 nucleotides long[2] and encodes a protein containing 245 amino acids.[3] The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus, but its function is not yet well understood by the scientific community. The gene was listed as a potential biomarker for detecting the efficacy of allergen immunotherapy.[4]

The gene is highly expressed in the testes and colon, but it is also expressed in the kidney, breast carcinomas, brain, and various endocrine glands.[5]

Gene

Locus and size

C12rf60 is located on Chromosome 12 beginning at 14,803,572 bp and ending at 14,823,858 bp, spanning 20,287 base pairs It is located on the forward/positive strand between the 12p12.3 and 12p13.1 cytogenic bands.[6] Other genes that are within 100 kilobases of this gene include:[7]

Common aliases

C12orf60 is also known as LOC144608 and MGC47869.

mRNA

A total of 22 exons exist within the gene.5 From these exons, there are 13 transcript variants. 12 of these transcript variants are predicted, and only a further 7 of these are predicted to encode a protein. Furthermore, they are predicted to encode the same protein.

The notable features of the mRNA sequence include two polyadenylation signals in the 3' untranslated region (UTR), and it is the target of several RNA-binding proteins (RBP) including RBP-MBNL1 in the 5' UTR. A single intron splice site exists in the primary transcript, as does an upstream in-frame stop codon.

Protein

Composition

C12orf60 has a predicted isoelectric point of 8.19 and a molecular weight of 27.6 kilodaltons. Glycine and tyrosine residues are relatively less prevalent compared to other proteins in the human proteome, while methionine is more prevalent.

Topology

The protein product is predicted to have multiple α-helices, coiled coil, and one β-sheet. It is suggested that the protein does not contain transmembrane regions or helices, meaning that the protein is not anchored to the cell membrane nor an intracellular membrane like the Golgi apparatus.[8]

Conserved domains

In the predicted protein product, C12orf60 contains a conserved protein domain of 225 amino acids. This domain (DUF4533) is within in the pfam15047 family of proteins. Only one other gene is listed within this family: C12orf69, which is also known as SMOC3 (single-pass membrane protein with coiled-coil domains 3).[9]

Analysis of human C12orf60 and 9 of its orthologs reveals a highly conserved ERL motif starting at the 10th residue of the human protein sequence. It is not known whether this motif occurs in other proteins.

Other conserved residues are Asp25, Ser28, Phe37, Met41, Glu69, Leu85, Lys88, Leu143, Pro147, Ile148, Leu151, Gln164, Lys189, Leu191, Ala207, and Glu212, Leu225, and Lys227. Furthermore, these residues lie within DUF4533, suggesting that these conserved amino acids are important for the function of the domain. Also, the region between the 100 and 150 residues are not conserved. Thus, this region is not likely vital to the protein's function.

Post-translational modification

Since it is predicted that the protein product is intracellular, extracellular modifications are not predicted to occur on C12orf60. Other modifications such as acetylation, phosphorylation, picornaviral protease cleavage, sumoylation, and O-beta-GlcNAcylation are predicted to occur on C12orf60 as well as several of its orthologous proteins. There are two amino acids that serve as sites of both phosphorylation and O-beta-GlcNAcylation, which may indicate a site of protein activation or inactivation.

Subcellular localization

C12orf60 is predicted to be localized in the nucleus, cytoplasm, or outside the cell.[10] [11] [12] [13] [14] [15] However, current literature supports its localization in the nucleus.

Expression

Tissue expression

Expression of C12orf60 is regulated. The gene is highly expressed in the testes and colon, but it is also expressed in the kidney, breast carcinomas, various endocrine glands, and some regions of the brain.[16] [17] It is also expressed in the embryo body and fetus during development.

Transcriptional regulation

The promoter GXP_71811 regulates the expression of C12orf60. The promoter is 1373 base pairs long and is also located on the positive strand. There are over 400 transcription factors that are possible matches for binding to this promoter, including those of the SOX/SRY-sex/testis determining, human and murine ETS, and homeodomain transcription factors.[18]

Protein interactions

Rolland et al. found that C12orf60 interacts with BMP4 (bone morphogenetic protein 4).[19] BMP4 induces bone and cartilage formation. It also acts in mesoderm induction and fracture repair.

Several other proteins might also interact with C12orf60,[20] and some are predicted to be co-expressed with the protein.[21] Possible protein interactions include L3MBTL4, C3orf67, FAM78A, and PXDC1. Rats that overexpressed L3MBTL4 had higher blood pressure and heart rate.[22]

Proteins that are thought to be co-expressed alongside C12orf60 include ELMOD2, TTC30B, and BCDIN3D. ELMOD2 is thought to be involved in antiviral responses and causing familial idiopathic pulmonary fibrosis.[23] TTC30B is involved in the organelle biogenesis and maintenance pathway as well as intraflagellar transport. BCDIN3D is a methyltransferase and serves as a negative regulator of miRNA processing.[24] As there is no agreement from various sources on any protein-protein interaction, it is difficult to determine if any of these interactions actually occur.

Paralogs

There are no known paralogs to this gene within the human genome, and no paralogs of C12orf60 were found within the selected species that have a C12orf60 protein ortholog.

Orthologs

Many orthologs are found in mammals and a couple of bird species.

Table of Select C12orf60 Orthologs in Other Species Compared to Human C12orf60!Genus and Species!Common Name!Estimated Time Since LCA of Protein (MY)!Accession # (mRNA)!Accession # (Protein)!Corrected Protein Sequence IdentityDeviation (%)
Homo sapiensHumans0NM_175874.3NP_787070.20
Pongo abeliiSumatran orangutan15.76XM_002822980.2XP_002823026.12.94
Rhinopithecus bietiBlack snub-nosed monkey29.44XM_017873538.1XP_017729027.19.87
Cebus capucinus imitatorWhite-headed capuchin43.2XM_017528453.1XP_017383942.19.87
Saimiri boliviensis boliviensisBolivian squirrel monkey43.2XM_003934554.2XP_003934603.110.3
Propithecus coquereliCoquerel's sifaka74XM_012652172.1XP_012507626.128.6
Ceratotherium simum simumSouthern white rhinoceros96XM_004435503.2XP_004435560.133.1
Physeter catodonSperm whale96XM_007107758.1XP_007107820.1 35.4
Equus caballusHorse96XM_001497318.3XP_001497368.138.3
Miniopterus natalensisNatal long-fingered bat96XM_016197505.1XP_016052991.140.8
Felis catusDomestic cat96XM_003988472.3XP_003988521.141.4
Ovis ariesSheep96XM_004007552.3XP_004007601.143.9
Ursus maritimusPolar bear96XM_008707031.1XP_008705253.144.0
Choloepus hoffmanniHoffman's two-toed sloth105N/AN/A44.5
Canis lupus familiarisDog96XM_005637113.2XP_005637170.148.1
Erinaceus europaeusWestern European hedgehog96XM_007533153.1XP_007533215.148.8
Dasypus novemcinctusNine-banded armadillo105XM_004460316.1XP_004460373.156.0
Echinops telfairiSmall Madagascar hedgehog105XM_004713800.1XP_004713857.156.0
Ochotona princepsAmerican pika90XM_004592653.2XP_004592710.156.7
Myotis brandtiiBrandt's bat96XM_005866788.2XP_005866850.159.2
Mus musculusHouse mouse90NM_178776.3NP_848891.262.0
Rattus norvegicusNorway rat90NM_001037797.1NP_001032886.167.3
Sorex araneusEuropean shrew96XM_004611368.1XP_004611425.175.1
Leptosomus discolorCuckoo roller312XM_009947218.1XP_009945520.1129 (100%)
Melopsittacus undulatusBudgerigar312N/AN/A160 (100%)

Clinical significance

References in literature

The gene is within 1 Mb of SNPs that were associated with obesity, height, and weight.[25]

The gene was listed along with two other genes in a patent as a potential biomarker for detecting the efficacy of allergen immunotherapy. Specifically, detection of 3 copies of C12orf60 meant that immunotherapy was ineffective.

In one study, the gene was among several identified genes that were translocated in a single patient with recurrent acute lymphoblastic leukemia.[26] This translocation was associated with apoptosis and tumorigenesis.

Another study found that the gene is upregulated by at least 1.5 fold in cells that expressed Constitutive Myocyte Enhancer Factor 2 (MEF2CA).[27] MEF2CA is expressed naturally in the brain.

One study stated the gene contains a “perfect potential antioxidant protein 1 (ATOX1) DNA interaction site in the promoter region.”[28]

Notes and References

  1. Web site: AceView: Gene:C12orf60, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView.. Danielle. Thierry-Mieg. Jean. Thierry-Mieg. www.ncbi.nlm.nih.gov. 2017-02-19.
  2. Web site: Homo sapiens chromosome 12 open reading frame 60 (C12orf60), mRNA - Nucleotide - NCBI. www.ncbi.nlm.nih.gov. 2017-02-19.
  3. Web site: uncharacterized protein C12orf60 [Homo sapiens] - Protein - NCBI]. www.ncbi.nlm.nih.gov. 2017-02-19.
  4. Hiroi, T., & Okubo, K. (2010). U.S. Patent Application No. 13/498,267.
  5. Marchler-Bauer, A., Bo, Y., Han, L., He, J., Lanczycki, C. J., Lu, S., ... & Gwadz, M. (2016). CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Research, gkw1129.
  6. Web site: C12orf60 Gene. www.genecards.org. 2017-02-19.
  7. Web site: C12orf60 chromosome 12 open reading frame 60 [Homo sapiens (human)] - Gene - NCBI]. www.ncbi.nlm.nih.gov. 2017-04-30.
  8. Web site: ExPASy: SIB Bioinformatics Resource Portal - Categories. www.expasy.org. en-US. 2017-04-24.
  9. Web site: NCBI CDD Conserved Protein Domain DUF4533. www.ncbi.nlm.nih.gov. en. 2017-02-27.
  10. Web site: SLP-Local. sunflower.kuicr.kyoto-u.ac.jp. 2017-04-30. 2016-01-22. https://web.archive.org/web/20160122071222/http://sunflower.kuicr.kyoto-u.ac.jp/~smatsuda/slplocal.html. dead.
  11. Web site: Hum-mPLoc 2.0 server. www.csbio.sjtu.edu.cn. 2017-04-30.
  12. Web site: BaCelLo. gpcr.biocomp.unibo.it. 2017-04-30.
  13. Web site: CELLO:Subcellular Localization Predictive System. cello.life.nctu.edu.tw. 2017-04-30. dead. https://web.archive.org/web/20160304093942/http://cello.life.nctu.edu.tw/. 2016-03-04.
  14. Web site: Hslpred: A svm based method for the subcellular localization of human proteins. www.imtech.res.in. 2017-04-30.
  15. Web site: ESLPred2 : Improved version of ESLPred. www.imtech.res.in. 2017-04-30.
  16. Web site: Home - UniGene - NCBI. www.ncbi.nlm.nih.gov. 2017-04-24.
  17. Web site: Expression Atlas < EMBL-EBI. www.ebi.ac.uk. 2017-04-24.
  18. Web site: Genome Annotation and Browser. Genomatix. 2021-12-02. https://web.archive.org/web/20211202010908/https://www.genomatix.de/. dead.
  19. Rolland T, Taşan M, Charloteaux B, Pevzner SJ, Zhong Q, Sahni N, Yi S, Lemmens I, Fontanillo C, Mosca R, Kamburov A, Ghiassian SD, Yang X, Ghamsari L, Balcha D, Begg BE, Braun P, Brehme M, Broly MP, Carvunis AR, Convery-Zupan D, Corominas R, Coulombe-Huntington J, Dann E, Dreze M, Dricot A, Fan C, Franzosa E, Gebreab F, Gutierrez BJ, Hardy MF, Jin M, Kang S, Kiros R, Lin GN, Luck K, MacWilliams A, Menche J, Murray RR, Palagi A, Poulin MM, Rambout X, Rasla J, Reichert P, Romero V, Ruyssinck E, Sahalie JM, Scholz A, Shah AA, Sharma A, Shen Y, Spirohn K, Tam S, Tejeda AO, Trigg SA, Twizere JC, Vega K, Walsh J, Cusick ME, Xia Y, Barabási AL, Iakoucheva LM, Aloy P, De Las Rivas J, Tavernier J, Calderwood MA, Hill DE, Hao T, Roth FP, Vidal M . 6 . A proteome-scale map of the human interactome network . Cell . 159 . 5 . 1212–1226 . November 2014 . 25416956 . 4266588 . 10.1016/j.cell.2014.10.050 .
  20. Web site: STRING: functional protein association networks. string-db.org. 2017-04-24.
  21. Web site: GeneMANIA. genemania.org. 2017-04-24.
  22. Web site: OMIM Entry - * 617135 - L3MBT-LIKE 4; L3MBTL4. www.omim.org. en-us. 2017-04-24.
  23. Web site: ELMOD2 Gene. www.genecards.org. 2017-04-24.
  24. Web site: BCDIN3D Gene. www.genecards.org. 2017-04-24.
  25. Zhou L, Ji J, Peng S, Zhang Z, Fang S, Li L, Zhu Y, Huang L, Chen C, Ma J . A GWA study reveals genetic loci for body conformation traits in Chinese Laiwu pigs and its implications for human BMI . Mammalian Genome . 27 . 11–12 . 610–621 . December 2016 . 27473603 . 10.1007/s00335-016-9657-4 . 24327418 .
  26. Chen C, Bartenhagen C, Gombert M, Okpanyi V, Binder V, Röttgers S, Bradtke J, Teigler-Schlegel A, Harbott J, Ginzel S, Thiele R, Husemann P, Krell PF, Borkhardt A, Dugas M, Hu J, Fischer U . Next-generation-sequencing of recurrent childhood high hyperdiploid acute lymphoblastic leukemia reveals mutations typically associated with high risk patients . Leukemia Research . 39 . 9 . 990–1001 . September 2015 . 26189108 . 10.1016/j.leukres.2015.06.005 .
  27. Chan SF, Huang X, McKercher SR, Zaidi R, Okamoto SI, Nakanishi N, Lipton SA . Transcriptional profiling of MEF2-regulated genes in human neural progenitor cells derived from embryonic stem cells . Genomics Data . 3 . 24–27 . March 2015 . 25485232 . 4255278 . 10.1016/j.gdata.2014.10.022 .
  28. Muller PA, Klomp LW . ATOX1: a novel copper-responsive transcription factor in mammals? . The International Journal of Biochemistry & Cell Biology . 41 . 6 . 1233–6 . June 2009 . 18761103 . 10.1016/j.biocel.2008.08.001 .