Uncharacterized protein C17orf50 is a protein which in humans is encoded by the C17orf50 gene.
The gene is located on the long arm of chromosome 17 on the forward strand[1] at position 17q12. C17orf50 spans 4,200 base pairs from 35,760,897 to 35,765,079. In humans, this gene encodes a protein that is 174 amino acids in length[2] and has three exons.[3]
The promoter region for C17orf50 is 1417 base pairs long with an accession number of GXP_123003 from Genomatix.[4] The first half of the promoter is poorly conserved even among primates.[5] [6]
There are many binding sites for transcription factors found in the brain and embryonic tissue, particularly Brn-5 POU domain factor, which has three binding sites within the conserved region of the promoter. This transcription factor is expressed in layer IV of the neocortex of adults and at its highest levels in the developing brain and spinal cord.[7]
Orthologs of this gene exist in eukaryotes, predominantly in mammals.[8] However, some homologs are present in birds, reptiles, and amphibians. There are no paralogs of this gene. The table below shows a short list of orthologs to trace the evolutionary history of C17orf50.
Species | Accession number | Divergence from humans (MYA)[9] | Identity | |
---|---|---|---|---|
Homo sapiens | NP_660315 | 0 | 100% | |
Chlorocebus sabaeus | XP_008009267 | 29.44 | 85% | |
Mus musculus | NP_079768.2 | 90 | 68% | |
Pteropus vampyrus | XP_011385558 | 171 | 70% | |
Chelonia mydas | EMP28888 | 312 | 45% | |
Corvus brachyrhynchos | XP_017584321 | 312 | 44% | |
Anolis carolinensis | XP_003218353 | 312 | 37% | |
Xenopus tropicalis | OCA35560 | 352 | 46% |
C17orf50 is expressed at low levels in various tissues, such as lung, prostate, thymus, thyroid, trachea, small intestine, and stomach, and it is most highly expressed in the fetal brain.[10]
The unmodified molecular weight of C17orf50 protein is 19.3 kilodaltons. The protein has a negative charge cluster from position 21 to 52; this is a glutamate-rich region.[11] There are three nuclear localization signals with no other retention signals, strongly indicating that the protein is localized to the nucleus.[12]
Uncharacterized protein C17orf50 contains a domain of unknown function (DUF4673) from position 5 to 172, which makes up the majority of the protein.
Uncharacterized protein C17orf50 contains two potential sumoylation sites at K7 and K12.[13] [14] There are possible threonine and serine glycosylation sites throughout the protein.[15] Potential threonine, serine, and tyrosine phosphorylation sites are also present.[16]
Uncharacterized protein C17orf50 has potential interactions with zinc finger protein 587(ZNF587),[17] [18] which is expressed throughout fetal tissue, including the brain,[19] ZNF587 is expected to regulate transcription.[20]