Chromosome 12 Open Reading Frame 42 (C12orf42) is a protein-encoding gene in Homo sapiens.
The genomic location for this gene is as follows: starts at 103,237,591 bp and ends 103,496,010 bp.[1] The cytogenetic location for C12orf42 is 12q23.2. It is located on the negative strand[2]
Fifteen different mRNAs are made by transcription: fourteen alternative splice variants and one unspliced form.
The protein released by this gene is known as uncharacterized protein C12orf42. There are three isoforms for this protein produced by alternative splicing. The first isoform is a conical sequence. The second Isoform differs form the first in that it doesn't contain 1-95 aa in its sequence. The third isoform differs from the conical sequence in two ways:[3]
C12orf42 protein takes on several secondary structures, such as: alpha helices, beta sheets, and random coils. C12orf42 protein is a soluble.[4] Proteins that are soluble have a hydrophilic outside and hydrophobic interior .[5] Proteins with this type of structure are able to freely float inside a cell, due to the liquid composition of the cytosol.
C12orf42 is an intracellular protein. This is known by the lack of transmembrane domains or signal peptides. This suggests that it is predicted to be a nuclear protein, given the nuclear localization signal (NSL) found: PRDRRPQ at 292 aa and a bipartite KRLIKVCSSAPPRPTRR at 325 aa.
Predicted post-translation modification sites are seen below in the table. Nuclear proteins are known for having phosphorylation, acetylation, sumoylation, and O-GlcNAc as types of modifications:
Type of Modification | Amino Acid Position | |
---|---|---|
Phosphorylation | Ser44,Ser47,Ser58,Ser74,Ser113,Ser115,Ser118,Ser123,Ser130,Ser134,Ser135,Ser205,Ser210,Ser217, Ser226,Ser238, Ser302,Thr17,Thr45,Thr145,Thr150,Thr228, Thr240,Thr240,Thr291,Thr339,Thr344,Tyr124[6] | |
Acetylation | Ser2[7] | |
Sumoylation | IPIVS32-36[8] | |
O-GlcNAc | Thr45,Ser58,Ser130,Ser135,Ser205,Ser210, Ser217,Thr339[9] |
Microarray data shows expression of the C12orf42 gene in different tissues throughout the human body. There is high expression in the lymph node, spleen, and thymus.[10] There is significant expression in the brain, bladder, epididymis, and the helper T cell. Therefore, there is statistically significant expression of C12orf42 gene throughout the nervous system, immune system, and male reproductive system.
The table below shows the areas in the mouse brain where C12orf42 is expressed. The gene name for the mouse is 1700113H08Rik, it is the human homolog of C12orf42.[11] Area one and two of the brain manages body and skeletal movement. Areas three and four in the brain are for sensory functions; area four specializes in perception of smell. Area five in the brain functions in emotional learning and memory.
Location in mouse brain | Area in Brain[12] | < | -- Deleted image removed: --> |
---|---|---|---|
Area #1 | Crus 1, granular layer | ||
Crus 2, granular layer | |||
Paramedian lobule, molecular | |||
Area #2 | Paraflocculus, granular layer | ||
Flocculus, granular layer | |||
Area #3 | Field CA1, pyramidal layer | ||
Field CA2, pyramidal layer | |||
Field CA3, pyramidal layer | |||
Area #4 | Piriform area, pyramidal layer | ||
Piriform-amygdalar area, pyramidal layer | |||
Area #5 | Cortical amygdalar area, posterior part, lateral zone, layer 2 | ||
Cortical amygdalar area, posterior part, Imedial zone, layer 2 |
C12orf42 gene has only one other member in its gene family, this gene is known as Neuroligin 4, Y linked gene (NLGN4Y).[13]
C12orf42 orthologs are mostly mammals. One exception that was found is the Pelodiscus Sinensis or more commonly known as the Chinese soft-shell turtle.
The domain structure that is most important is DUF4607, it is conserved in the Eutheria clade in the Mammalia class. The order that it is conserved in is as follows: Artiodactyla, Carnivora, Chiroptera, Lagomorpha, Perissodactyla, Primates, Proboscidea, and Rodentia.