Uncharacterized protein chromosome 7 open reading frame 61 is an asparagine-poor protein in humans encoded by the c7orf61 gene. The protein function is relatively unknown and is highly conserved in mammals.
C7orf61 is located on the reverse (or negative) DNA strand and is situated in chromosome 7 (7q22.1) from base pairs 100,456,615-100,464,271 - roughly 7,656 bp.[1] It has a total of 3 exons and lacks isoforms.
The mRNA is approximately 1019 bp and belongs to domain of unknown function 4703 (PFAM15775).[2]
In humans, the protein contains a total of 206 amino acids.[1] The protein's molecular weight is 23.71 kDa and its isoelectric point is 10.41.[3] DUF4703 is positioned 22-206aa of the protein.[1] Neither the gene or its protein has another known alias, but can be found with high affinity in several primate species.[4]
The amino acid composition of C7orf61 consists of high frequencies in leucine, serine, and charged valine.[5] The protein has an unusual low frequency in asparagine, making it an asparagine-deficient protein,[5] and contains higher frequencies of salt-bridge formations between glutamic acid, aspartic acid, lysine, and arginine.[5]
The consent within secondary structure prediction tools CFSSP,[6] SSPRED,[7] and GOR4 [8] is that the protein's secondary structure consist mainly of α-helices (51.2%), with significant amounts of coiling (38.2%) and smaller fragments of beta strands (10.3%).
C7orf61 has several post-translation modification sites, most of which involve serine/threonine kinases - protein kinase C, Casein kinase II, DNA-dependent protein kinase, and Cyclin-dependent kinase 1. It is predicted to contain a Biparte nuclear localization signal (NLS_BP), a leucine-rich variant domain (LRV), and bacterial Ig-like domain (BIG-1).[9]
C7orf61 does not contain any trans-membrane domains or signal peptides.[10] [11] The protein is predicted to be localized in the Mitochondria, with little indication of extracellular activity.[12] [13] Expanded analysis of amino acid sequence KFFRWVRRAWQRIISWVF
near the N-terminal suggests the presence of a mitochondrial targeting signal.[14]
C7orf61 has high levels of expression in the testis and lower levels in the brain and connective tissues.[15] Through the assessment of microarray experiments available on NCBI Geo, its inferred that c7orf61 is under negative regulation.[16]
C7orf61 does not have any paralogs. Analysis via NCBI tool BLASTt[17] found the gene to be highly conserved in mammals and could not be traced farther back than 160 MYA. The following table contains a list of orthologs found in several mammalian sub-classes - this is not a comprehensive list for the proteins orthology.
Species | Common Name | Divergence (MYA) | Accession number (from NCBI [18]) | Sequence Length | Percent Identity | Percent Similarity | |
---|---|---|---|---|---|---|---|
Homo sapiens | Human | 0 | NP_001004323.1 | 206 | 100% | 100% | |
Ceratotherium simum | White Rhinoceros | 96 | XP_014652622.1 | 204 | 64% | 81% | |
Canis lupus | Wolf | 96 | XP_008963687.1 | 203 | 64% | 78% | |
Bos taurus | Cow | 96 | XP_005225278.1 | 204 | 64% | 78% | |
Physeter catodon | Sperm Whale | 96 | XP_007110691.1 | 204 | 63% | 76% | |
Condylura cristata | Mole | 96 | XP_004691685.1 | 204 | 62% | 75% | |
Eptesicus fuscus | Brown bat | 96 | XP_008139625.1 | 213 | 61% | 75% | |
Elephantulus edwardii | Elephant shrew | 105 | XP_006896643.1 | 147 | 58% | 78% | |
Phascolarctos cinereus | Koala | 159 | XP_020834746.1 | 160 | 41% | 58% | |
Sarcophilus harrisii | Tasmanian Devil | 160 | XP_012404266.1 | 162 | 37% | 61% |