Trinucleotide repeat containing 18 explained

Trinucleotide repeat containing 18 is a protein that in humans is encoded by the TNRC18 gene.[1]

Function

The exact function of TNRC18 is not yet well understood by the scientific community. The protein sequence provided by the National Center for Biotechnology Information (NCBI) database includes a Bromo Adjacent Homology (BAH) Domain within TNRC18.[2] BAH domains are often found in chromatin-associated proteins and assist in the silencing of genes.[3]

Gene

According to the UCSC Genome Browser, TNRC18 is located within Chromosome 7 in humans (chr7: 5,306,800-5,423,546). There are 29 introns and 30 exons listed. Directly preceding TNRC18 is SLC29A4 and immediately following is AC092171.4.[4] SLC29A4 encodes the plasma membrane monoamine transporter in humans.

GeneCards lists five aliases for TNRC18, Long CAG Trinucleotide Repeat-Containing Gene 79 Protein, Trinucleotide Repeat-Containing Gene 18 Protein, CAGL79, KIAA1856, and TNRC18A. Additionally, TNRC18 has two paralogs, BAH Domain And Coiled-Coil Containing 1 (BAHCC1) and Bromo Adjacent Homology Domain Containing 1 (BAHD1).[5]

mRNA and Isoforms

The NCBI gene page for TNRC18 lists 9 different protein isoforms across 12 transcript variant mRNA sequences.[6] TNRC18 isoform X7 is encoded by mRNA transcript variants X7-X10. Additionally, isoforms X8 and X9 are produced by variants X11 and X12 respectively.

Protein

The protein sequence provided by NCBI lists human TNRC18 having a length of 2968 amino acids. The Compute pI/Mw tool program by ExPASy[7] predicts the isoelectric point and molecular weight for the TNRC18 to be 8.88 and 315 kDa respectively. Additionally, the NCBI protein sequence for TNRC18 contains nine phosphorylation sites on TNRC18, eight phosphoserines and one phosphothreonine. There is a large serine repeat upstream of the BAH site located from amino acid positions 2604–2670. The BAH site is located on position 2816–2960.

The predicted secondary structure for TNRC18 consists of 32.61% alpha helix, 6.74% extended strand, and 60.55% random coil. This was found using the GOR4 program available at PRABI-Lyon-Gerland with the NCBI protein sequence for TNRC18.[8] [9]

Expression

RNA sequencing of TNRC18 tissue samples found ubiquitous gene expression. Most prominent expression was observed within the colon, kidney, and prostate tissue samples. In fetal human tissue samples, notable expression was found in the stomach, lung, and brain. RNA sequencing data was acquired though the TNRC18 gene expression page found on NCBI.[10]

The Human Protein Atlas shows highest RNA expression of TNRC18 in the brain, endocrine tissue, and muscle tissue. Additionally, the highest protein expression is observed in the brain, endocrine tissue, lung, gastrointestinal tract, and male and female specific tissues. Conversely, there is no protein expression in the eye or blood tissue, yet ubiquitous RNA expression for TNRC18.[11]

TNRC18 expression in mouse brain can be found below. Noteworthy expression is observed in the olfactory bulb, isocortex, and cerebellar cortex shown in color. This image and brain atlas information is provided by the Allen Institute Brain Atlas.[12]

Homology

NCBI Protein BLAST search for reference proteins lists the following orthologs for human TNRC18. The table is ordered first by increasing estimated date of divergence from humans in millions of years (MYA) and then by highest-to-lowest sequence identity with humans. Date of divergence information was acquired from TimeTree[13] and sequence identify and similarity percentages were found by a pairwise sequence alignment using the European Bioinformatics Institute (EMBL-EBI) EMBOSS Needle program.[14]

Orthologs for Homo sapiens TNRC18 gene!Species!Common name!NCBI Protein Accession!Date of Divergence (MYA)!Sequence Identity with Humans (%)!Sequence Similarity with Humans (%)!Length (aa)
Homo SapiensHumanNP_00107396401001002968
Pongo abeliiSumatran orangutanXP_02409703715.7698.698.92964
Nomascus leucogenysNorthern white-cheeked gibbonXP_03065273419.898.498.82965
Ictidomys tridecemlineatusThirteen-lined ground squirrelXP_0215756549085.388.22924
Rattus norvegicusBrown RatNP_0011005939081.485.62900
Acinonyx jubatusCheetahXP_0268981119687.589.92972
Orcinus orcaKiller whaleXP_0123881769687.389.82967
Lynx canadensisCanada lynxXP_0301568109687.389.82966
Enhydra lutris kenyoniSea otterXP_0223566069686.889.62999
Odocoileus virginianus texanusWhite-tailed deerXP_0207303769686.289.12939
Ursus arctos horribilisGrizzy BearXP_0263713239682.1853111
Haliaeetus leucocephalusBald EagleXP_01056862031258.769.42928
Apteryx rowiOkarito kiwiXP_02593907031258.769.32932
Pogona vitticepsCentral bearded dragonXP_02065809131254.765.72943
Python bivittatusBurmese pythonXP_01574487431253.6652872
Perca flavescensYellow perchXP_02845430843539.150.53044
Amphiprion ocellarisOcellaris clownfishXP_02315030143539503071
Branchiostoma belcheriLanceletXP_01964605968425.636.52799
Saccoglossus kowalevskiiAcorn wormXP_00273850968423.634.83174
Crassostrea virginicaEastern oysterXP_0223194737972131.62200

Predicted post-translational modifications and motifs

The following post-translational modifications and motifs are predicted for TNRC18 and found on the ExPASy Proteomics page.[15] Exception to GPS-MSP methylation program which is found on The Cuckoo Workgroup site.[16] This list is not conclusive of the total post-translational modifications or motifs associated with TNRC18 and is solely based on software predictions.

Of the predicted post-translational modifications, there are 92 O-Linked β-N-acetylglucosamine (O-ß-GlcNAc) sites with a high scoring threshold (>=0.5), 23 Sumoylation sites, two palmitoylation sites, one methylation site, and 52 glycation sites. Additionally, GPS 5.0 predicted 22,317 phosphorylation sites on TNRC18. The program was used to confirm the nine phosphorylation sites found on the NCBI protein page for TNRC18.

Predicted post-translational modifications and motifs for TNRC18!Program!Predicted post-translational modification!Amino Acid location on protein
YinOYangO-ß-GlcNAc80, 185, 199, 352, 416, 626, 627, 640, 788, 991, 995, 1033, 1038, 1533, 1753, 1956, 2023, 2368, 2404, 2510, 2557, 2559–2573, 2611, 2614–2667, 2721, 2892
GPS-SUMO (SUMOsp) Sumoylation238-242, 467, 620, 652, 858–862, 1159, 1258, 1461, 1544, 1629, 1638, 1704, 1743, 1885–1889, 1893, 1898, 1899, 2098, 2213–2217, 2259–2263, 2463, 2542, 2964-2968
CSS-Palm Palmitoylation284, 1196
GPS-MSPMethylation2332
GPS 5.0 PhosphorylationPhosphorylation263, 611, 1127, 1136, 1540, 1857, 1863, 2146, 2771
NetGlycateGlycation156, 197, 270, 272, 429, 492, 548, 609, 652. 692, 749, 755, 938, 988, 1058, 1059, 1131, 1370, 1461, 1470, 1503, 1554, 1558, 1577, 1615, 1618, 1791, 1797, 1893, 1895, 1898, 1899, 1933, 1967, 1978, 2028, 2091, 2301, 2315, 2328, 2388, 2438, 2475, 2519, 2702, 2709, 2720, 2750, 2801, 2816, 2857, 2869
Eukaryotic Linear Motif (ELM)Coiled-Coil region916-949, 1481-1516

Clinical significance

Shen et al. observed circTNRC18 inhibiting miR-762 activity within pre-eclampsia (PE) placenta tissue samples.[17] The inhibition of miR-762 by circTNRC18 resulted in elevated Grhl2 protein levels. PE placenta samples were observed to have lower miR-762 levels and higher Grhl2 levels which was attributed to overexpression of circTNRC18. Shen et al. conclude that circTNRC18 was upregulated in PE placentas when compared with normal pregnancy placentas.

Chu et al. found that from 19 CpG sites linked with glomerular filtration rate (eGFR), 5 were also linked with renal fibrosis and DNA methylation occurrences in the kidney cortex of chronic kidney disease (CKD) patients.[18] Chu et. note that reduced eGFR is a defining feature of (CKD). These 5 CpG sites were found in proteins TNRC18, PTPN6/PHB2, ANKRD11, PQLC2, and PRPF8. Chu et al. conclude that epigenetic variation may be associated with CKD.

Notes and References

  1. Web site: Entrez Gene: Trinucleotide repeat containing 18 . 2016-10-22 .
  2. Web site: trinucleotide repeat-containing gene 18 protein [Homo sapiens] - Protein - NCBI]. www.ncbi.nlm.nih.gov. 2020-05-01.
  3. Web site: InterPro. www.ebi.ac.uk. 2020-05-01.
  4. Web site: Human hg38 chr7:5,306,800-5,423,546 UCSC Genome Browser v397. genome.ucsc.edu. 2020-05-03.
  5. Web site: TNRC18 Gene - GeneCards TNC18 Protein TNC18 Antibody. www.genecards.org. 2020-05-01.
  6. Web site: TNRC18 trinucleotide repeat containing 18 [Homo sapiens (human)] - Gene - NCBI]. www.ncbi.nlm.nih.gov. 2020-05-01.
  7. Web site: ExPASy - Compute pI/Mw tool. web.expasy.org. 2020-05-01.
  8. NPS@: Network Protein Sequence Analysis

    TIBS 2000 March Vol. 25, No 3 [291]:147-150

    Combet C., Blanchet C., Geourjon C. and Deléage G.

  9. Web site: NPS@ : GOR4 secondary structure prediction. npsa-prabi.ibcp.fr. 2020-05-03.
  10. Web site: TNRC18 Gene Expression - Gene - NCBI. www.ncbi.nlm.nih.gov. 2020-05-03.
  11. Web site: Tissue expression of TNRC18 - Summary - The Human Protein Atlas. www.proteinatlas.org. 2020-05-03.
  12. Web site: Interactive Atlas Viewer :: Atlas Viewer. atlas.brain-map.org. 2020-05-03.
  13. Web site: TimeTree :: The Timescale of Life. www.timetree.org. 2020-05-03.
  14. Web site: EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI. www.ebi.ac.uk. 2020-05-03.
  15. Web site: ExPASy: SIB Bioinformatics Resource Portal - Categories. www.expasy.org. 2020-05-03.
  16. Web site: GPS-MSP - Methyl-group Specific Predictor 1.0. msp.biocuckoo.org. 2020-05-03.
  17. Shen XY, Zheng LL, Huang J, Kong HF, Chang YJ, Wang F, Xin H . CircTRNC18 inhibits trophoblast cell migration and epithelial-mesenchymal transition by regulating miR-762/Grhl2 pathway in pre-eclampsia . RNA Biology . 16 . 11 . 1565–1573 . November 2019 . 31354028 . 6779405 . 10.1080/15476286.2019.1644591 .
  18. Chu AY, Tin A, Schlosser P, Ko YA, Qiu C, Yao C, Joehanes R, Grams ME, Liang L, Gluck CA, Liu C, Coresh J, Hwang SJ, Levy D, Boerwinkle E, Pankow JS, Yang Q, Fornage M, Fox CS, Susztak K, Köttgen A . 6 . Epigenome-wide association studies identify DNA methylation associated with kidney function . Nature Communications . 8 . 1 . 1286 . November 2017 . 29097680 . 5668367 . 10.1038/s41467-017-01297-7 .