C3orf56 Explained

C3orf56 is a protein encoding gene found on chromosome 3. Although, the structure and function of the protein is not well understood, it is known that the C3orf56 protein is exclusively expressed in metaphase II of oocytes and degrades as the oocyte develops towards the blastocyst stage. Degradation of the C3orf56 protein suggests that this gene plays a role in the progression from maternal to embryonic genome and in embryonic genome activation.[1]

Gene

The C3orf56 gene is positioned at 3q21.3 on the plus strand and is 5,055 base pairs in length.[2] The gene occupies base pair 127193131 to 127198185.[3] C3orf56 has two aliases: FLJ40141 and LOC285311.[4]

Transcript

The primary assembly contains 2 exons and is 242 amino acids in length.[5]

Protein

C3orf56 has a predicted molecular weight of 26 kdal[6] and an isoelectric point of 8.48.[7]

Primary Structure

C3orf56 is rich in proline, serine, and tryptophan.[8] It has one highly conserved internal repeat which spans from G85 to P125 and G146 to P178.[9]

Secondary Structure

Only one highly conserved alpha helical structure was predicted.[10] [11]

Tertiary Structure

The tertiary structure of C3orf56 was predicted with a confidence score of -3.08.[12] It is suggested to possess many hydrogen bonds and salt bridges.[13]

Protein-Protein Interactions

C3orf56 has been predicted to interact with tyrosine-protein kinase transmembrane receptor (ROR2)[14] and oocyte-expressed protein homolog (OOEP).[15]

Gene level regulation

Expression

C3orf56 has shown to be over-expressed in the testes.[16] Relative to the expression of all other genes, C3orf56 has shown an almost absence of expression in human tissues (excluding the ovary) than in metaphase II oocytes.[17] C3orf56 also has an expression relatively lower in the 8-cell embryo stage and significantly lower in the morula and blastocyst stages compared to the 1-cell, 2-cell, and 4-cell stages.[18]

RNA binding proteins and transcription factors

RNA binding proteins seem to be conserved within the 5’ UTR and show some functional significance with sex and development.[19] Many predicted transcription factors also demonstrated a functional importance in development.[20]

Transcript level regulation

Predicted stem loops

Hypothetical stem loops appear to be slightly more prevalent in the 3’ UTR sequence with more conservation of stem loops in the 5’ UTR sequence.[21]

Protein level regulation

Post-translational modifications

C3orf56 has many predicted post-translational modifications. Predicted protein kinase C phosphorylation sites were found at and S6-K8, K181-C183, and S228-R230.[22] Predicted casein kinase II phosphorylation sites at S109-S112, S213-E186, and S218-L220.[23] General phosphorylation sites were predicted at T3, S6, S21, S109, T160, and S227.[24] N-myristoylation sites were predicted to be found at G2-E7, G26-S31, G38-S43, and G146-S151.[25]

Localization

A potential nuclear localization signal was found at position P231 to R238 along with a suggested nuclear tendency.[26]

Evolutionary History

The earliest appearance of the C3orf56 gene was approximately 102 million years ago within the species Orycteropus afer. Compared to the speed at which the proteins fibrinogen alpha and cytochrome c evolved, C3orf56 evolves very rapidly. This relationship was limited to more recently diverged species.

Homology

C3orf56 has numerous orthologs that are only within various placental mammals.[27] No known paralogs of C3orf56 exist at this time.

Table 1 Orthologs of C3orf56!Genus and Species!Common Name!Taxonmic Order!Date of Divergence (MYA)!Accession Number!Sequence Length (aa)!% Sequence Identity!% Sequence Similarity
Homo sapiensHumanPrimate0NP_001007535.1242100100
Rhinopithecus bietiBlack Snub-nosed MonkeyPrimate29XP_017740448.12488789
Saimiri boliviensis boliviensisBolivian Squirrel MonkeyPrimate43XP_003944171.12497679
Galeopterus variegatusFlying LemurDermoptera83XP_008592749.12225764
Castor canadensisBeaverRodentia89XP_020010355.12045056
Jaculus jaculusLesser Egyptian JerboaRodentia89XP_012805719.12263849
Heterocephalus glaberNaked Mole RatRodentia89XP_012924683.12643849
Mirounga leoninaSouthern Elephant SealCarnivora94XP_034854170.11785764
Phocoena sinusVaquitaArtiodactyla94XP_032503804.12365761
Eumetopias jubatusNorthern Sea LionCarnivora94XP_027957377.11955464
Canis lupus familiarisDogCarnivora94XP_022263109.12325360
Ailuropoda melanoleucaPandaCarnivora94XP_011228030.21755258
Equus caballusHorsePerissodactyla94XP_014584106.22404958
Sus scrofaWild BoarCetartiodactyla94XP_020926903.12314956
Vicugna pacosAlpacaArtiodactyla94XP_031532218.13664956
Camelus dromedariusArabian CamelArtiodactyla94XP_031325652.12334553
Enhydra lutris kenyoniSea OtterCarnivora94XP_022377075.12474350
Crocuta crocutaHyenaCarnivora94KAF0873334.12163946
Bison bison bisonBisonArtiodactyla94XP_010857995.12733421
Monodon monocerosNarwahlCetacea94TKC45752.12653343
Orycteropus aferAardvarkTubulidentata102XP_007951414.11134554

Notes and References

  1. Bouckenheimer J, Fauque P, Lecellier CH, Bruno C, Commes T, Lemaître JM, De Vos J, Assou S. Differential long non-coding RNA expression profiles in human oocytes and cumulus cells. Sci Rep. 2018 Feb 2;8(1):2202. doi: 10.1038/s41598-018-20727-0. ; PMCID: PMC5797088
  2. C3orf56 chromosome 3 open reading frame 56 Homo sapiens - Gene - NCBI. (2020, August 22). Retrieved September 30, 2020, from https://www.ncbi.nlm.nih.gov/gene/285311
  3. Database, G. (n.d.). C3orf56 Gene (Protein Coding). Retrieved September 30, 2020, from https://www.genecards.org/cgi-bin/carddisp.pl?gene=C3orf56
  4. Web site: C3orf56 Gene - GeneCards CC056 Protein CC056 Antibody. 2020-12-19. www.genecards.org.
  5. C3orf56 chromosome 3 open reading frame 56 [Homo sapiens (human)] - Gene - NCBI. (2020, August 22). Retrieved September 30, 2020, from https://www.ncbi.nlm.nih.gov/gene/285311
  6. Web site: SAPS < Sequence Statistics < EMBL-EBI. 2020-12-17. www.ebi.ac.uk.
  7. Web site: ExPASy - Compute pI/Mw tool. 2020-12-17. web.expasy.org.
  8. Web site: SAPS < Sequence Statistics < EMBL-EBI. 2020-12-17. www.ebi.ac.uk.
  9. Web site: Dotlet JS. 2020-12-17. dotlet.vital-it.ch.
  10. Web site: NPS@ : GOR4 secondary structure prediction. 2020-12-17. npsa-prabi.ibcp.fr.
  11. Web site: Prof. T. Ashok Kumar. CFSSP: Chou & Fasman Secondary Structure Prediction Server. 2020-12-17. www.biogem.org.
  12. Web site: I-TASSER results. 2020-12-19. zhanglab.ccmb.med.umich.edu.
  13. Web site: iCn3D: Web-based 3D Structure Viewer. 2020-12-19. www.ncbi.nlm.nih.gov.
  14. Web site: PSICQUIC View. 2020-12-18. www.ebi.ac.uk.
  15. Web site: STRING: functional protein association networks. 2020-12-18. string-db.org.
  16. Web site: C3orf56 chromosome 3 open reading frame 56 [Homo sapiens (human)] - Gene - NCBI]. 2020-12-16. www.ncbi.nlm.nih.gov.
  17. Web site: 52817401 - GEO Profiles - NCBI. 2020-12-17. www.ncbi.nlm.nih.gov.
  18. Web site: 74223601 - GEO Profiles - NCBI. 2020-12-19. www.ncbi.nlm.nih.gov.
  19. Web site: RBPDB: The database of RNA-binding specificities. 2020-12-19. rbpdb.ccbr.utoronto.ca.
  20. Web site: Genomatix. 2020-12-19. de-DE. 2001-02-24. https://web.archive.org/web/20010224072831/http://www.genomatix.de/. dead.
  21. Web site: RNAfold web server. 2020-12-19. rna.tbi.univie.ac.at.
  22. Web site: Motif Scan. 2020-12-17. myhits.sib.swiss. en.
  23. Web site: Motif Scan. 2020-12-17. myhits.sib.swiss. en.
  24. Web site: NetPhos 3.1 Server. 2020-12-17. www.cbs.dtu.dk.
  25. Web site: Motif Scan. 2020-12-17. myhits.sib.swiss. en.
  26. Web site: PSORT II Prediction. 2020-12-19. psort.hgc.jp.
  27. BLAST: Basic Local Alignment Search Tool. (2020, September 29). Retrieved September 29, 2020, from https://blast.ncbi.nlm.nih.gov/Blast.cgi