C16orf82 Explained
C16orf82 is a protein that, in humans, is encoded by the C16orf82 gene.[1] C16orf82 encodes a 2285 nucleotide mRNA transcript which is translated into a 154 amino acid protein using a non-AUG (CUG) start codon. The gene has been shown to be largely expressed in the testis, tibial nerve, and the pituitary gland, although expression has been seen throughout a majority of tissue types.[2] [3] [4] The function of C16orf82 is not fully understood by the scientific community.[5]
Gene
Locus
C16orf82 is located in humans at locus 16p12.1 on the positive strand.
General features
The gene encodes for a 2285 nucleotide mRNA transcript that is intronless. Human intronless genes represent a unique subset of the genome that are often involved in signaling, sperm formation, immune responses, or development.[6] C16orf82 being such a gene indicates it may play a role in one of these processes. Translation of C16orf82 initiates at a non-AUG (CUG) start codon. The presence of the non-canonical start codon suggests possible increased regulation of C16orf82 translation and/or possibly could allow for the translation of protein products that start with leucine instead of methionine as seen in proteins coded for by some genes present in the major histocompatibility complex.[7] [8]
DNA level regulation
Promoter
The C16orf82 promoter region has been predicted to contain a number of transcription factor binding sites including binding sites for transcription factors within the SOX family.[9] The presence of the SOX family transcription binding sites suggests that C16orf82 may play a role in sex determination.[10] Actual transcription factor functional studies show binding of the C16ORF82 promoter by ARNT, ELF5, SMAD4, and STAT3.[11]
Expression
C16orf82 expression in humans has been observed in major organ systems including the heart, liver, brain, and kidney at a constant level.[12] The tissue in which C16orf82 has been seen to be most highly expressed has been the testis, both by microarray experiments as well as RNA-seq. C16orf82 expression is also highly variable between individuals, with some expressing the gene in large amounts while others barely express the gene within the same tissue type.[13] Micro RNA (miR-483) over expression has been shown to knock down C16orf82 expression.[14]
Protein
General features
The C16orf82 protein is 154 amino acids in length with an approximate molecular weight of 16.46 kDa with a predicted isoelectric point of 6.06.[15] There are no known variants or isoforms of C16orf82.
Domains
C16orf82 contains one domain, DUF4694, which currently has a function that is uncharacterized. The domain spans from amino acid 8 to amino acid 153.[16] DUF4694 contains a SSGY (serine-serine-glycine-tyrosine) sequence motif that is found in a majority of the protein's orthologs.[17] [18] There is no presence of a transmembrane domain thus the protein is not a transmembrane protein.
Cellular localization
The localization of C16orf82 within a cell has been predicted to be nuclear.[19] A bipartite nuclear localization signal can be found starting at Arg107.
Post-translational modifications
The human C16orf82 protein has been predicted to be phosphorylated at a number of serine residues.[20] O-linked glycosylation has also been predicted to happen at a number of sites, including some that overlap with the aforementioned phosphorylation sites.[21] The sites of overlap between the two types of post-translational modifications could play important regulatory roles in the activity and lifespan of the human C16orf82 protein.[22]
Secondary structure
The secondary structure of the human C16orf82 protein has been predicted to be largely disordered by a number of modeling programs.[23] [24] [25] [26]
Evolution/homology
Paralogs
No paralogs of C16orf82 exist within humans.
Orthologs
C16orf82 has over 100 predicted orthologs, which all reside in the class mammalia and more precisely the subclass eutheria.[27] All of the orthologs contained the domain DUF4964. The most distant ortholog detected was within the nine-banded armadillo (Dasypus novemcinctus) within the order Cingluata. Below is a table of 20 orthologs from various orders within the subclass eutheria with the sequence identity and time since divergence in relation to humans.
Rate of evolution
C16orf82's rate of evolution was determined to be relatively fast even in comparison to fibrinogen, a gene that has been shown to evolve quickly.[30]
Clinical significance
Behavioral disorders
C16orf82 has been associated with Schizophrenia through a genome-wide association study and autism based on copy number variation analysis.[31] [32] Currently, research has not shown if C16orf82 plays any direct role in either of these disorders.
References
- Web site: C16orf82 chromosome 16 open reading frame 82 [Homo sapiens (human)] - Gene - NCBI]. www.ncbi.nlm.nih.gov. 2018-02-05.
- Sato T, Kaneda A, Tsuji S, Isagawa T, Yamamoto S, Fujita T, Yamanaka R, Tanaka Y, Nukiwa T, Marquez VE, Ishikawa Y, Ichinose M, Aburatani H . PRC2 overexpression and PRC2-target gene repression relating to poorer prognosis in small cell lung cancer . Scientific Reports . 3 . 1911 . 2013-05-29 . 23714854 . 3665955 . 10.1038/srep01911 . 2013NatSR...3E1911S .
- Ardlie KG, Deluca DS, Segrè AV, Sullivan TJ, Young TR, Gelfand ET, etal . GTEx Consortium . Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans . Science . 348 . 6235 . 648–60 . May 2015 . 25954001 . 4547484 . 10.1126/science.1262110 .
- Jelinsky SA, Rodeo SA, Li J, Gulotta LV, Archambault JM, Seeherman HJ . Regulation of gene expression in human tendinopathy . BMC Musculoskeletal Disorders . 12 . 86 . May 2011 . 21539748 . 3095578 . 10.1186/1471-2474-12-86 . free .
- Web site: C16orf82 Gene - GeneCards TNT Protein TNT Antibody. Database. GeneCards Human Gene. www.genecards.org. 2018-02-19.
- Grzybowska EA . Human intronless genes: functional groups, associated diseases, evolution, and mRNA processing in absence of splicing . Biochemical and Biophysical Research Communications . 424 . 1 . 1–6 . July 2012 . 22732409 . 10.1016/j.bbrc.2012.06.092 .
- Glass NL . Near-Cognate Codons Contribute Complexity to Translation Regulation . mBio . 8 . 6 . e01820–17 . November 2017 . 29114030 . 5676045 . 10.1128/mbio.01820-17 .
- Starck SR, Jiang V, Pavon-Eternod M, Prasad S, McCarthy B, Pan T, Shastri N . Leucine-tRNA initiates at CUG start codons for protein synthesis and presentation by MHC class I . Science . 336 . 6089 . 1719–23 . June 2012 . 22745432 . 10.1126/science.1220270 . 2012Sci...336.1719S . 206540614 .
- Web site: Genomatix: Login Page. www.genomatix.de. en-US. 2018-04-22.
- Barrionuevo F, Scherer G . SOX E genes: SOX9 and SOX8 in mammalian testis development . The International Journal of Biochemistry & Cell Biology . 42 . 3 . 433–6 . March 2010 . 19647095 . 10.1016/j.biocel.2009.07.015 .
- Lachmann A, Xu H, Krishnan J, Berger SI, Mazloom AR, Ma'ayan A . ChEA: transcription factor regulation inferred from integrating genome-wide ChIP-X experiments . Bioinformatics . 26 . 19 . 2438–44 . October 2010 . 20709693 . 2944209 . 10.1093/bioinformatics/btq466 .
- Yanai I, Benjamin H, Shmoish M, Chalifa-Caspi V, Shklar M, Ophir R, Bar-Even A, Horn-Saban S, Safran M, Domany E, Lancet D, Shmueli O . Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification . Bioinformatics . 21 . 5 . 650–9 . March 2005 . 15388519 . 10.1093/bioinformatics/bti042 . free .
- Goh SH, Josleyn M, Lee YT, Danner RL, Gherman RB, Cam MC, Miller JL . The human reticulocyte transcriptome . Physiological Genomics . 30 . 2 . 172–8 . July 2007 . 17405831 . 10.1152/physiolgenomics.00247.2006 .
- Liu M, Roth A, Yu M, Morris R, Bersani F, Rivera MN, Lu J, Shioda T, Vasudevan S, Ramaswamy S, Maheswaran S, Diederichs S, Haber DA . The IGF2 intronic miR-483 selectively enhances transcription from IGF2 fetal promoters and enhances tumorigenesis . Genes & Development . 27 . 23 . 2543–8 . December 2013 . 24298054 . 3861668 . 10.1101/gad.224170.113 .
- Book: 2005. 978-1-58829-343-5. 571–607. en-gb. 10.1385/1592598900. Walker. John M.. The Proteomics Protocols Handbook . 43080491. John M. Walker.
- Web site: protein TNT [Homo sapiens] - Protein - NCBI]. www.ncbi.nlm.nih.gov. 2018-02-19.
- Web site: NCBI CDD Conserved Protein Domain DUF4694. group. NIH/NLM/NCBI/IEB/CDD. www.ncbi.nlm.nih.gov. en. 2018-02-19.
- Web site: Protein BLAST: search protein databases using a protein query. blast.ncbi.nlm.nih.gov. en. 2018-02-19.
- Nakai K, Horton P . PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization . Trends in Biochemical Sciences . 24 . 1 . 34–6 . January 1999 . 10087920 . 10.1016/S0968-0004(98)01336-X .
- Blom N, Gammeltoft S, Brunak S . Sequence and structure-based prediction of eukaryotic protein phosphorylation sites . Journal of Molecular Biology . 294 . 5 . 1351–62 . December 1999 . 10600390 . 10.1006/jmbi.1999.3310 .
- Steentoft C, Vakhrushev SY, Joshi HJ, Kong Y, Vester-Christensen MB, Schjoldager KT, Lavrsen K, Dabelsteen S, Pedersen NB, Marcos-Silva L, Gupta R, Bennett EP, Mandel U, Brunak S, Wandall HH, Levery SB, Clausen H . Precision mapping of the human O-GalNAc glycoproteome through SimpleCell technology . The EMBO Journal . 32 . 10 . 1478–88 . May 2013 . 23584533 . 3655468 . 10.1038/emboj.2013.79 .
- Funakoshi Y, Suzuki T . Glycobiology in the cytosol: the bitter side of a sweet world . Biochimica et Biophysica Acta (BBA) - General Subjects . 1790 . 2 . 81–94 . February 2009 . 18952151 . 10.1016/j.bbagen.2008.09.009 . free .
- Book: Yaoqi . Zhou . Andrzej . Kloczkowski . Eshel . Faraggi . Yuedong . Yang . vanc . Prediction of protein secondary structure . 9781493964048. New York, NY. 961911230 . 2016-10-28.
- Drozdetskiy A, Cole C, Procter J, Barton GJ . JPred4: a protein secondary structure prediction server . Nucleic Acids Research . 43 . W1 . W389-94 . July 2015 . 25883141 . 4489285 . 10.1093/nar/gkv332 .
- Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJ . The Phyre2 web portal for protein modeling, prediction and analysis . En . Nature Protocols . 10 . 6 . 845–58 . June 2015 . 25950237 . 5298202 . 10.1038/nprot.2015.053 .
- Biasini M, Bienert S, Waterhouse A, Arnold K, Studer G, Schmidt T, Kiefer F, Gallo Cassarino T, Bertoni M, Bordoli L, Schwede T . SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information . Nucleic Acids Research . 42 . Web Server issue . W252-8 . July 2014 . 24782522 . 4086089 . 10.1093/nar/gku340 .
- Web site: ortholog_gene_162083[group] - Gene - NCBI]. www.ncbi.nlm.nih.gov. 2018-02-19.
- Kumar S, Stecher G, Suleski M, Hedges SB . TimeTree: A Resource for Timelines, Timetrees, and Divergence Times . Molecular Biology and Evolution . 34 . 7 . 1812–1819 . July 2017 . 28387841 . 10.1093/molbev/msx116 . free .
- Web site: Home - Protein - NCBI. www.ncbi.nlm.nih.gov. 2018-04-23.
- Zhang J, Yang JR . Determinants of the rate of protein sequence evolution . En . Nature Reviews. Genetics . 16 . 7 . 409–20 . July 2015 . 26055156 . 4523088 . 10.1038/nrg3950 .
- McCarthy MJ, Nievergelt CM, Kelsoe JR, Welsh DK . A survey of genomic studies supports association of circadian clock genes with bipolar disorder spectrum illnesses and lithium response . PLOS ONE . 7 . 2 . e32091 . 2012-02-22 . 22384149 . 3285204 . 10.1371/journal.pone.0032091 . 2012PLoSO...732091M . free .
- Wang LS, Hranilovic D, Wang K, Lindquist IE, Yurcaba L, Petkovic ZB, Gidaya N, Jernej B, Hakonarson H, Bucan M . Population-based study of genetic variation in individuals with autism spectrum disorders from Croatia . BMC Medical Genetics . 11 . 134 . September 2010 . 20858243 . 2954843 . 10.1186/1471-2350-11-134 . free .