C5orf34 Explained

C5orf34 (chromosome 5 open reading frame 34) is a protein that in humans is encoded by the C5orf34 gene (5p12).[1] [2]

C5orf34 is conserved in mammals, birds and reptiles with the most distant ancestor being the Burmese python, Python bivittatus. The C5orf34 protein contains two mammalian conserved domains: DUF 4520 and DUF 4524. The protein is also predicted to have a polo-box domain (PBD) of polo-like kinase 4 (plk4), which has predicted conservation in distant orthologs from the clade Aves.[3] [4]

Gene

C5orf34 is located on the negative DNA strand of the short arm of chromosome 6 at locus 12. The gene is 28,744 base pairs long and spans from base pair 43,486,701 to base pair 43,515,445. The gene produces a single transcript of 2,540 base pairs long and encodes for 638 amino acids.[5]

Gene neighborhood

The gene PAIP1 is found on the negative strand just downstream of C5orf34 and is a member of the polyadenylate-binding family. PAIP1 extends from base pairs 43,526,267 to 43,557,419.[6] CCL28 is found downstream on the negative strand and extends from base pairs 43378052 to 43413837.[7]

Gene expression

There indication of multiple sources that suggest, in humans, C5orf34 protein is expressed non-ubiquitously in select tissues at low/moderate levels, with the most abundant expression in the tissues of the stomach, small intestine, testis, skeletal muscle and heart muscle.[8] [9] A study of Rho kinase inhibitor effect on primary cell lines also showed that C5orf34 is expressed in dermal fibroblasts of normal human tissue samples.[10]

Promoter

The promoter region for C5orf34 is predicted to be between 43515079 and 43515773 and spans 695 base pairs.[11]

Protein

C5orf34 consists of 638 amino acids, has a weight of 72.7 kDa and an isoelectric point of 7.77 in humans.[12] [13]

Function

Although the precise function of C5orf34 in humans remains unknown, there is evidentiary support based on structure that it is involved in kinase-related cellular functions.[14] In addition, C5orf34 is predicted to be nuclear, thus it has potential involvement in gene regulation and cell proliferation seeing as these are two primary signal transduction pathways involve nuclear kinase proteins.[15] [16]

Structure

In humans, C5orf34 contains two domains of unknown function, DUF 4520 (pfam 15016) and DUF 4524 (pfam 150125), found between residues 6-153 and 444–539, respectively. The protein is serine and threonine rich. The charge distribution of the protein is equally dispersed per there are no positive or negative charge clusters sequestered within the protein.

The predicted secondary structures of the human protein were assessed by multiple bioinformatic tools. All of the programs predicted the protein's structure to consist of alpha helices, extended strands, random coils and beta turns. The Phyre2 server provided a predicted human protein structure that indicated domains of plk polo-box of the serine/threonine-protein kinase plk4. The server predicted with 96.8% confidence of 20% coverage (130 residues) of the protein. The coverage exhibited residues of the conserved polo-box domain and the two DUF domains. The protein was predominantly soluble, with an average hydrophobicity of -0.478.[17] [18]

Post-translational modifications

There is extensive, predicted phosphorylation of C5orf34, with 32 phosphoserines and 7 phosphothreonines being conserved in orthologs of the human C5orf34 protein. This analysis indicates C5orf34 as a phosphoprotein and supports structural predictions of it being a kinase protein. The protein contains only one nuclear export signal residue, found at 481-L; however the NES score was found to be low at 0.515. Structural analysis of the protein indicated it was sequestered in the nucleus with an 87% probability.[19] [20]

Interacting proteins

Databases of protein interactions (MINT, STRING, IntAct, and BioGRID) have not identified any interactions with C5orf34.

Homology and evolution

C5orf34 is highly conserved in primates and mammals and moderately conserved in reptiles. The furthest conserved ortholog is in Python bivittatus, or the Burmese python. Below is a selected list of orthologs to demonstrate the homology of this gene with relation to the reference sequence in Homo sapiens.

Orthologous space

151 organisms have been predicted orthologs with C5orf34. The most distant ortholog is the Burmese python, which diverged from humans 296 million years ago, indicating C5orf34 developed in reptiles and birds.[21]

Table of C5orf34 orthologs

Scientific NameCommon NameDate of Divergence from Humans (MYA)[22] NCBI Protein Accession #Protein Length (amino acids)Sequence Similarity (%)
Homo sapiensHuman0NP_001076895.1638100
Gorilla gorillaGorilla8.8XP_004058945.163692
Camelus ferusBactrian Camel97.4XP_006191979.164084
Panthera tigris altaicaSiberian Tiger97.4XP_007095478.163883
Sus scrofaWild Boar97.4XP_003133971.344180
Bos TarusCattle97.4NP_001076895.163880
Erinaceus europaeusEuropean Hedgehog97.4XP_007517686.163269
Mus MusculusHouse Mouse91BAE28742.138275
Monodelphis domesticaGray Short-tailed Opossum176.1XP_007487459.151262
Chelonia mydasGreen Turtle324.5XP_007052886.163851
Aptendodytes forsteriEmperor Penguin324.5XP_009272830.164748
Gallus gallusChicken324.5XP_424782.366948
Python bivittatusBurmese python324.5XP_007430528.164946

Paralogous space

There are no predicted paralogs for C5orf34 in both humans and mice.

Conserved regions

Multiple sequence alignments indicated amino acid residue conservation throughout the C5orf34 protein in an array of orthologs, with the most highly conserved regions at both N-terminus and C-terminus where the DUF are located. DUF 4520 (pfam 15016) was found to be conserved in C-terminus and DUF 4524 (pfam 150125) was found to be conserved in the N-terminus. Also, the polo-box domain of plk4 was found to be conserved in the C-terminus in a multiple sequence alignment in both strict and distant orthologs.

Notes and References

  1. Web site: NCBI Protein. www.ncbi.nlm.nih.gov. 2015-05-09.
  2. Web site: NCBI Gene. www.ncbi.nlm.nih.gov. 2015-05-09.
  3. Web site: NCBI Blast. www.ncbi.nlm.nih.gov. 2015-05-09.
  4. Polo-like kinase 4: the odd one out of the family. Cell Division. 2010-09-29. 1747-1028. 20920249. 25. 5. 1. 10.1186/1747-1028-5-25. James E.. Sillibourne. Michel. Bornens. 2955731 . free .
  5. Web site: Ensembl Genome Browser. www.ensembl.org. 2015-05-09.
  6. Web site: NCBI Gene. www.ncbi.nlm.nih.gov. 2015-05-09.
  7. Web site: NCBI Gene. www.ncbi.nlm.nih.gov. 2015-05-09.
  8. Web site: Tissue expression of C5orf34 - Summary - The Human Protein Atlas. www.proteinatlas.org. 2015-05-09.
  9. Web site: NCBI GeoProfile. www.ncbi.nlm.nih.gov. 2015-05-09.
  10. Comparative gene expression profiling in three primary human cell lines after treatment with a novel inhibitor of Rho kinase or atorvastatin. Blood Coagulation & Fibrinolysis. 2008. 10.1097/MBC.0b013e32830b2891. Boerma. Marjan. Fu. Qiang. Wang. Junru. Loose. David S.. Bartolozzi. Alessandra. Ellis. James L.. McGonigle. Sharon. Paradise. Elsa. Sweetnam. Paul. Fink. Louis M.. Vozenin-Brotons. Marie-Catherine. Hauer-Jensen. Martin. 19. 7. 709–718. 18832915. 2713681.
  11. Web site: Genomatix: Annotation & Analysis. www.genomatix.de. 2015-05-09.
  12. Web site: Statistical Analysis of PS (SAPS). 5 May 2015. Biology Workbench. Subramaniam, Shankar.
  13. Web site: ExPASy - Compute pI/Mw tool. web.expasy.org. 2015-05-09.
  14. Web site: Phyre Investigator output for C5orf34__ with c1umwB_. www.sbg.bio.ic.ac.uk. 2015-05-09.
  15. Nuclear protein kinases. Molecular and Cellular Biochemistry. 1984-03-01. 0300-8177. 81–99. 59. 1–2. 10.1007/BF00231306. 6323962. Harry R.. Matthews. Verena D.. Huebner. 25765323 .
  16. Web site: PSORT II server. www.genscript.com. 2015-05-09. 2021-07-09. https://web.archive.org/web/20210709182415/http://www.genscript.com/cgi-bin/tools/psort2.pl. dead.
  17. Web site: NPS@ : SOPMA secondary structure prediction. npsa-prabi.ibcp.fr. 2015-05-09. Institut. UCBL.
  18. Web site: PELE - Protein Energy Landscape Exploration - Web Server. pele.bsc.es. 2015-05-09. Armin. Sobhani.
  19. Web site: NetPhos 2.0 Server. www.cbs.dtu.dk. 2015-05-09.
  20. Web site: NetNES 1.1 Server. www.cbs.dtu.dk. 2015-05-09.
  21. Web site: CLUSTALW. 5 May 2015. SDSC. Subramaniam, Shankar.
  22. Web site: TimeTree :: The Timescale of Life. www.timetree.org. 2015-05-10.