C16orf46 Explained

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

Gene

An alternative name for this gene is FLJ32702, however it is most commonly referred to as C16orf46.[1]

Location

The C16orf26 gene is found on chromosome 16q23.2 negative strand. The promoter region is 1152 base pairs long.[2] It has three exons, one from 1-380 bp, the second from 381 to 1254 bp, and the third from 1255 to 1982 bp.[3]

Expression

C16orf46 is broadly expressed in the testis and thyroid as well as 18 other tissues.[4] These tissue expression patterns are found to be low to moderate (25-50%).[5] When looking at tissue profiles, the highest expression is in the adult mammalian kidney, liver, prefrontal cortex, cerebellum, heart, and brain.[6]

Protein

Protein Analysis

The full C16orf46 protein is 417 amino acids long.[7] It has no isoforms, and its most distant ortholog, Rhincodon typus (whale shark), also has no known isoforms.[8] The molecular weight was found to be 45.8 kdal.[9] The isoelectric point is 7.4, average for all proteins, and C16orf46 is electrically neutral.[10]

C16orf46 is predicted to be found in the nucleus by all orthologs.[11]

The secondary structure of C16orf46 has alternating alpha helices and beta sheets.[12]

Protein Level Regulation

In C16orf46, there is N-linked glycosylation, O-linked glycosylation, and SUMOylation.[13] [14]

There are phosphorylation sites found with the kinases CKII, CKI, PKC, and cdc2.[15]

A coronavirus cleavage site is predicted at the 235 amino acid position.[16] There are also tyrosine motif locations between amino acids 42-45 and 251–252.[17]

Transcript Level Regulation

mRNA folding on the 5' UTR predicts a stem loop twice in the area between base pairs 47–90.[18]

Homologs

Orthologs

C16orf46 has over 50 orthologs ranging from primate to chordate.[19] The table below shows a representation of the diversity of C16orf46 by listing a selection of orthologs found using NCBI. When C16orf46 Homo sapiens was run through a multiple alignment sequence program, Clustal Omega, against 20 true orthologs and 16 distant orthologs, Trp74 and Pro212 were found to be conserved in all.[20]

!Species!Common Name!Divergence (MYA)!Accession Number!Identity
Homo sapiensHumans---XP_016878405.1100.0%
Ochotona princeps American Pika90XP_004584265.152.7%
Octodon degusCommon Degu90XP_003434773.247.8%
Ursus maritimus Polar Bear96XP_008687958.167.5%
Leptonychotes weddellii Weddell Seal96XP_006748170.167.2%
Canis lupusGray Wolf96XP_003434773.265.8%
Pteropus vampyrusLarge Flying Fox96XP_011354946.163.5%
Sus scrofaWild Boar96XP_020952705.161.5%
Bos indicusZebu96XP_019835282.160.2%
Erinaceus europaeusEuropean Hedgehog96XP_007516703.156.7%
Loxodonta africanaAfrican Bush Elephant105XP_010596137.160.9%
Sarcophilus harrisiiTasmanian Devil159XP_003757901.143.1%
Apteryx australisSouthern Brown Kiwi312XP_013796688.118.5%
Aptenodytes forsteriEmperor Penguin312XP_019327074.117.4%
Chelonia mydasGreen Sea Turtle312XP_007059324.129.7%
Gekko japonicusGekko Japonicus312XP_015261305.125.3%
Nanorana parkeriHigh Himalaya Frog352XP_018410908.122.4%
Pygocentrus nattereriRed Bellied Piranha435XP_017578196.121.2%
Lepisosteus oculatusSpotted Gar435XP_015223705.120.6%
Callorhinchus miliiAustralian Ghost Shark473XP_007887408.122.7%

Paralogs

C16orf46 has no known paralogs.

Mutations

C16orf46 has been compared against Fibrinogen, a protein which mutates rapidly, and Cytochrome C, a protein which mutates slowly.

As can be seen below, when multiple species of the three proteins were plotted, C16orf46 more closely resembled that of Fibrinogen than Cytochrome C, suggesting a possible rapid mutation.

Interacting Proteins

C16orf46 interacts with FAT3 which has been linked to neurite interactions during development.[21] C16orf46 is thought to have coexpression with the PLAC8L1 and CFAP43 gene, both of unknown function.[22]

Clinical Significance

There are higher levels of C16orf46 expression in pancreatic adenocarcinoma tumor epithelia tissue compared to the control.[23] There is also higher gene expression in patients with small-cell carcinoma compared to the control.[24]

Notes and References

  1. Web site: C16orf46 Gene - GeneCards CP046 Protein CP046 Antibody. Database. GeneCards Human Gene. www.genecards.org. 2018-05-07.
  2. Web site: Genomatix - NGS Data Analysis & Personalized Medicine. www.genomatix.de. 2018-05-07. 2001-02-24. https://web.archive.org/web/20010224072831/http://www.genomatix.de/. dead.
  3. Web site: Gene: C16orf46 (OTTHUMG00000137629) - Summary - Homo sapiens - Vega Genome Browser 68. vega.archive.ensembl.org. en-gb. 2018-05-07.
  4. Web site: C16orf46 Symbol Report HUGO Gene Nomenclature Committee. www.genenames.org. 2018-05-01.
  5. Web site: Home - GEO - NCBI. geo. www.ncbi.nlm.nih.gov. en. 2018-05-07.
  6. Web site: Gene: C16orf46 - ENSG00000166455. bgee.org. en. 2018-05-07.
  7. Web site: uncharacterized protein C16orf46 isoform X1 [Homo sapiens] - Protein - NCBI]. www.ncbi.nlm.nih.gov. 2018-05-07.
  8. Web site: uncharacterized protein C16orf46 homolog [Rhincodon typus] - Protein - NCBI]. www.ncbi.nlm.nih.gov. 2018-05-07.
  9. Web site: CALCULATION OF PROTEIN ISOELECTRIC POINT. Kozlowski. Lukasz P.. isoelectric.org. en. 2018-05-07.
  10. Web site: SAPS < Sequence Statistics < EMBL-EBI. EMBL-EBI. www.ebi.ac.uk. en. 2018-05-06.
  11. Web site: PSORT WWW Server. psort.hgc.jp. 2018-05-07.
  12. Web site: Bioinformatics Toolkit. toolkit.tuebingen.mpg.de. 2018-05-07.
  13. Web site: NetNGlyc 1.0 Server. www.cbs.dtu.dk. en. 2018-05-07.
  14. Web site: NetOGlyc 4.0 Server. www.cbs.dtu.dk. en. 2018-05-07.
  15. Web site: NetPhos 3.1 Server. www.cbs.dtu.dk. en. 2018-05-07.
  16. Web site: NetCorona 1.0 Server. www.cbs.dtu.dk. en. 2018-05-07.
  17. Web site: Human Protein Reference Database. www.hprd.org. 2018-05-07. https://web.archive.org/web/20060424071622/http://www.hprd.org/. 2006-04-24. dead.
  18. Web site: The Mfold Web Server mfold.rit.albany.edu. unafold.rna.albany.edu. en. 2018-05-07.
  19. Web site: BLAST: Basic Local Alignment Search Tool. blast.ncbi.nlm.nih.gov. 2018-05-07.
  20. Web site: Clustal Omega < Multiple Sequence Alignment < EMBL-EBI. EMBL-EBI. www.ebi.ac.uk. en. 2018-05-07.
  21. Web site: BioGRID Database of Protein, Chemical, and Genetic Interactions. Lab. Mike Tyers. thebiogrid.org. en. 2018-05-07.
  22. Web site: C16orf46 protein (human) - STRING interaction network. string-db.org. en. 2018-05-07.
  23. Web site: GDS4103 / 230281_at. www.ncbi.nlm.nih.gov. 2018-05-07.
  24. Web site: GDS4794 / 230281_at. www.ncbi.nlm.nih.gov. 2018-05-07.