Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.
C16orf95 is a Homo sapiens gene oriented on the minus strand of chromosome 16. It is located on the cytogenic band 16q24.2 and spans 14.62 kilobases.[1] The gene contains 6 introns and 7 exons.
There are no known paralogs of C16orf95.
Orthologs of C16orf95 exist only in mammals (identified with BLAST).[2] The most distant orthologs are found in opossums and Tasmanian devils.
Genus and species | Common name | NCBI accession | Date of divergence | Sequence identity | |
Homo sapiens | Human | NP_001182053 | 0 mya | 100% | |
Pan paniscus | Bonobo | XP_008972565 | 6.2 mya | 92% | |
Gorilla gorilla gorilla | Gorilla | XP_004058157 | 8.3 mya | 95% | |
Nomascus leucogenys | White-cheeked gibbon | XP_003272503 | 19.3 mya | 88% | |
Mandrillus leucophaeus | Drill | XP_011827052 | 27.3 mya | 78% | |
Propithecus coquereli | Lemur | XP_012513111 | 77.1 mya | 62% | |
Tupaia chinensis | Tree shrew | XP_006152612 | 86.5 mya | 58% | |
Oryctolagus cuniculus | European rabbit | XP_008250325 | 90.1 mya | 56% | |
Mus musculus | Mouse | NP_083873 | 90.1 mya | 54% | |
Rattus norvegicus | Rat | XP_006222844 | 90.1 mya | 51% | |
Camelus bactrianus | Camel | XP_010966555 | 95 mya | 63% | |
Canis lupus familiaris | Dog | XP_005620646 | 95 mya | 63% | |
Equus caballus | Horse | XP_005608538 | 95 mya | 60% | |
Felis catus | Cat | XP_011288582 | 95 mya | 60% | |
Bos taurus | Cattle | XP_015331266 | 95 mya | 60% | |
Lipotes vexillifer | Yangtze river dolphin | XP_007468528 | 95 mya | 50% | |
Myotis lucifugus | Brown bat | XP_014318589 | 95 mya | 56% | |
Trichechus manatus latirostris | Manatee | XP_004377854 | 102 mya | 66% | |
Loxodonta africana | Elephant | XP_003418190 | 102 mya | 59% | |
Orycteropus afer afer | Aardvark | XP_007937409 | 102 mya | 54% | |
Monodelphis domestica | Opossum | XP_007477328 | 162.4 mya | 42% | |
Sarcophilus harrisii | Tasmanian devil | XP_012395810 | 162.4 mya | 41% |
There are three splice variants of C16orf95.[3] The longest transcript contains 1156 base pairs and 7 exons.[4] Compared to variant 1, the second transcript variant lacks exons 4 and 5.[5] This alternative splicing results in a frameshift of the 3' coding region, and a shorter, unique C-terminus. The third transcript variant lacks exons 4 and 5, and uses an alternate 5' exon and start codon.[6] The resulting peptide has unique N- and C-termini compared to variant 1.
Size (base pairs) | ||||
---|---|---|---|---|
Exon # | Variant 1 | Variant 2 | Variant 3 | |
1 | 330 | 330 | 334 | |
2 | 52 | 52 | 52 | |
3 | 126 | 126 | 126 | |
4 | 147 | – | – | |
5 | 37 | – | – | |
6 | 187 | 187 | 187 | |
7 | 277 | 278 | 278 | |
Total | 1,156 | 973 | 977 |
The 3' untranslated region of the C16orf95 mRNA contains binding sites for KH domain-containing, RNA-binding, signal transduction-associated protein 3 (KHDRBS3) within an internal loop structure. KHDRBS3 regulates mRNA splicing and may act as a negative regulator of cell growth.[7]
The expression of C16orf95 is not well characterized. However, it has been detected at low levels in the following tissue types: bone, brain, ear, eye, intestine, kidney, lung, lymph nodes, prostate, testes, tonsils, skin, and uterus.[8]
The longest isoform of the C16orf95 protein has 239 amino acids.[9] It has a conserved domain of unknown function spanning residues 76 to 239. C16orf95 has a calculated molecular weight of 26.5 kDa, and a predicted isoelectric point of 9.8. Compared to other human proteins, C16orf95 has more cysteine, arginine, and glutamine residues. It has fewer aspartate, glutamate, and asparagine. The high ratio of basic to acidic amino acids contributes to the protein's higher isoelectric point.
C16orf95 is predicted to have several alpha-helices in its C-terminus. This is true for the human and mouse proteins. The N-terminus does not have significant cross-program consensus for secondary structure.
The tools available at ExPASy were used to predict post-translational modification sites on C16orf95.[10] The following modifications are predicted: palmitoylation, phosphorylation, and O-linked glycosylation. Bolded residues in the table indicate sites that are conserved in more than one species.
Predicted modification | Sites - Homo sapiens | Sites - Mus musculus | Sites - Canis lupus familiaris | Tool | |
---|---|---|---|---|---|
Palmitoylation | C77, C80, C126, C178,C187 | C24, C41, C90 | C64, C113, C174 | CSS-Palm[11] | |
Phosphorylation | S6, S9, S53, T57, S68,S91, S111, T122, S166 | S30, S76, S89, S120, T134, S141 | S15, S35, T39, S153 | NetPhos 2.0[12] | |
O-β-GlcNAc | S4, S6, S9, T57, S111 | None | None | NetOGlyc 4.0[13] |
C16orf95 has a large number of amino acid changes over time, indicating it is a quickly evolving protein.
There are no proteins known to interact with C16orf95.
Deletions of C16orf95 have been associated with hydronephrosis, microcephaly, distichiasis, vesicoureteral reflux, and intellectual impairment.[14] [15] However, the deletions included coding regions of the following genes: F-box Protein 31 (FBXO31), Microtubule-Associated Protein 1 Light Chain 3 Beta (MAP1LC3B), and Zinc Finger CCHC Type 14 (ZCCHC14). The contributions of each of these genes to the observed phenotypes has yet to be scientifically determined.