TSBP1 explained

TSBP1 is a protein that in humans is encoded by the TSBP1 gene.[1] [2] TSBP1 was previously known as C6orf10. C6orf10 is an open reading frame on chromosome 6 containing a protein that is ubiquitously expressed at low levels in the adult genome and may play a role during fetal development. C6orf10 has been found to be linked to both neurodegenerative and autoimmune diseases in adults. Expression of this gene is highest in the testis but is also seen in other tissue types such as the brain, lens of the eye and the medulla.[3] [4] [5]

mRNA Transcript

C6orf10 contains seven human mRNA splice variants (a, b, c, X1, X2, X3, X4).

Conserved non-coding region

TSBP1 contains a highly conserved stem loop structure in the 3' UTR from bases 100–124.

Protein

Composition

C6orf10 isoform a is rich in lysine (K), Glutamine (Q) and Glutamic acid (E) and poor in Histidine (H) and Phenylalanine(F)[6] ]. Isoform a is a basic protein with an isoelectric point of 9 and a molecular weight of 62,000 kDa.[7]

This isoform contains two transmembrane regions near the beginning of the amino acid sequence.[8] The first transmembrane region spans from residue 6 to residue 25 (19 total residues) and has an isoelectric point of 5. The second transmembrane region spans from residue 100 to residue 119 (19 total residues) and has an isoelectric point of 8. Isoform a contains a PTZ00121 domain[9] starting with residue 221 and going until the end of the protein. There are several repetitive sequences within this domain.

Secondary Structure

TSBP1 consists mostly of alpha helices and random coils.[10] There are only a few regions that contain beta sheets.[11] [12]

Subcellular Localization

TSBP1 is predicted to be localized to the Nucleus and the Endoplasmic Reticulum. There is a signal peptide cleavage site between amino acid 30 and 31[13] which includes the first transmembrane domain. This N-terminal region of C6orf10 is likely localized to the endoplasmic reticulum. The C-terminal region of the protein contain two nuclear localization signals from amino acid 489-505 and 513-529 indicating that the section of the protein after the signal peptide cleavage site is localized to the nucleus.

Expression

TSBP1 is ubiquitously expressed at low levels in the adult human genome. In adults, expression of this gene is highest in the testis.[14] C6orf10 is expressed at higher levels in fetal and embryonic tissues. This indicates C6orf10 may play a role in development.

Regulation of expression

Transcriptional

TSBP1 has a promoter that is 1206 bases long.[15] This promoter overlaps with the 3' UTR but ends before the first codon. This promoter is fairly well conserved across primates except for a 136 nucleotide region midway through and the end of the promoter region.[16] Primates have insertions at these two regions that humans are missing. This may suggest that these regions of the promoter are not essential to humans.

Transcription factors

TSBP1 transcription is regulated by the binding of many transcription factors to the promoter region. The CCAAT binding protein and TATA box are highly conserved regions that are important in the initiation of transcription. Several of the transcription factors including EH1, NACA, NKX5-2, SIX4, VCR, etc. are involved in developmental pathways.[15]

AbbreviationTranscription Factor Full NameMatrix scoreStrand
CSRNP-1Cytosine-Serine rich nuclear protein 1(AXUD1, AXIN1 up-regulated-1)1.0+|-|CCAAT Box|CCAAT/enhancer binding protein (C/EBP), gamma|0.923|+|-|EH1|Engrailed Homeobox 1|0.862|+|-|Cart-1|Cartilage homeoprotein 1|0.997|-|-|ZFP 263|Zinc finger protein 263, ZKSCAN12 (Zinc finger protein with KRAB and SCAN domains 12)|0.921|+
SWI/SNFSWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily, a member 30.999-|-|TATA Box|Vertebrate TATA binding protein factor|0.899|+
HSF2Heat shock Factor 20.974-|-|Hmx2/Nkx5-2|Hmx2/Nkx5-2 homeodomain transcription factor|0.933|+|-|Pdx1|Insulin promoter factor 1; pancreatic and duodenal homeobox 1 (Pdx1)|0.924|+|-|LMX1A|LIM homeobox transcription factor 1 alpha|1.0|+|-|NACA|Nascent polypeptide-associated complex subunit alpha 1|1.0|-|-|Oct1|Octamer binding factor 1|0.921|+|-|POU6F1|POU class 6 homeobox 1 |0.973|+|-|STAT 5B|Signal transducer and activator of transcription 5B|0.973|+
SIX 4Sine oculis homeobox homolog 40.96-|-|NMP4|Nuclear matrix protein 4|0.971|+|-|MSX|Homeodomain proteins MSX-1 and MSX-2|0.989|-|-|AREB6|Atp1a1 regulatory element binding factor 6|1.0|-
VCRVertebrate caudal related homeodomain protein0.963+|}

Protein Interactions

Most of the predicted protein interactions with C6orf10 are based solely on text mining and information gathered from genome-wide association studies. The two proteins with the highest interaction scores were Butyrophilin-like protein 2 (BTNL2) and Tetratricopeptide repeat domain containing TTC32.[17] BTNL2 is a negative regulator of T-cell activity and member of the immunoglobulin superfamily. BTNL2 is located in the C6orf10 gene neighborhood. TTC32 is from a protein family of structural repeat motifs that mediate protein-protein interactions in the formation of protein-protein complexes.[18] This may indicate the potential for C6orf10 to interact with another protein for form a complex.

Clinical significance

C6orf10 has been found to be associated with both neurodegenerative diseases and autoimmune diseases. These associations are mostly obtained from genome wide association studies. Common neurodegenerative diseases associated with C6orf10 include frontotemporal dementia, Parkinson's disease,[19] and Alzheimer's disease. Autoimmune diseases associated with C6orf10 include Rheumatoid arthritis,[20] [21] [22] psoriasis,[23] multiple sclerosis,[24] Grave's disease and lupus.

Homologs

Orthologs

By searching the NCBI BLAST database[25] for protein-protein interactions, it was found that C6orf10 is a protein only found in mammals. The BLAST database found the highest number of homologs in the Primates, Artiodactyla, and Carnivora. There were only a couple of homologs in the taxonomic orders of Rodentia, Chiroptera, and Perissodactyla. In the orders of Scandentia, Eulipotyphyla, Tubulidentata and sirenia there was only one complete homolog, but a few partial sequences do exist. There were partial protein sequences in Lagomorpha, Dermoptera, and Macroscelidea and there were no orthologs in Diprotodontia, Didelphimorphia, Cetacea, Dasyuromorphia, Pilosa, Monotremata, and Proboscidea. No homologs were found outside of mammals.

C6orf10 Isoform X4 Orthologs

Latin NameCommon NameIdentityMedian Date of divergence (MYA)
PrimatesHomo sapiensHuman100%0
PrimatesGorilla gorilla gorillaGorilla83.36%8.61
PrimatesPongo abeliiSumatran Orangutan82.81%15.2
CarnivoraCanis lupus familiarisDog49.79%94
CarnivoraCanis lupus dingoAustralian Dog49.33%94
ArtiodactylaBubalus bubalisWater Buffalo49.16%94
ArtiodactylaEquus caballusHorse49.00%94
ArtiodactylaOdocoileus virginianus texanusWhite Tailed Deer44.59%94
RodentiaChinchilla lanigeraChinchilla41.99%88
RodentiaMus caroliRyuku mouse41.06%88
CarnivoraFelis catusCat40.77%94
RodentiaRattus norvegicusBrown Rat37.45%88

Paralogs

C6orf10 has one paralog that diverged about 135.6 million years ago. This paralog is called Thioredoxin domain containing protein 2 (TXNDC2).----[1] BLAST: Basic Local Alignment Search Tool. National Center for Biotechnology InformationAvailable at: https://blast.ncbi.nlm.nih.gov/Blast.cgi. Accessdate 4 March 2019.

Further reading

  • Ficarro S, Chertihin O, Westbrook VA, White F, Jayes F, Kalab P, Marto JA, Shabanowitz J, Herr JC, Hunt DF, Visconti PE . Phosphoproteome analysis of capacitated human sperm. Evidence of tyrosine phosphorylation of a kinase-anchoring protein 3 and valosin-containing protein/p97 during capacitation . The Journal of Biological Chemistry . 278 . 13 . 11579–89 . March 2003 . 12509440 . 10.1074/jbc.M202325200 . free .
  • Suzuki Y, Yoshitomo-Nakagawa K, Maruyama K, Suyama A, Sugano S . Construction and characterization of a full length-enriched and a 5'-end-enriched cDNA library . Gene . 200 . 1–2 . 149–56 . October 1997 . 9373149 . 10.1016/S0378-1119(97)00411-3 .
  • Maruyama K, Sugano S . Oligo-capping: a simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides . Gene . 138 . 1–2 . 171–4 . January 1994 . 8125298 . 10.1016/0378-1119(94)90802-8 .

Notes and References

  1. Stammers M, Rowen L, Rhodes D, Trowsdale J, Beck S . BTL-II: a polymorphic locus with homology to the butyrophilin gene family, located at the border of the major histocompatibility complex class II and class III regions in human and mouse . Immunogenetics . 51 . 4–5 . 373–82 . April 2000 . 10803852 . 10.1007/s002510050633 . 31938388 .
  2. Web site: Entrez Gene: C6orf10 chromosome 6 open reading frame 10.
  3. Web site: AceView: Gene:C6orf10, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView.. www.ncbi.nlm.nih.gov. 2019-02-25.
  4. Web site: TSBP1 testis expressed basic protein 1 [Homo sapiens (human)] - Gene - NCBI]. www.ncbi.nlm.nih.gov. 2019-02-25.
  5. Web site: Tissue expression of C6orf10 - Summary - The Human Protein Atlas. www.proteinatlas.org. 2019-02-25.
  6. Web site: SAPS < Sequence Statistics < EMBL-EBI. www.ebi.ac.uk. 2019-05-06.
  7. Web site: ExPASy - Compute pI/Mw tool. web.expasy.org. 2019-05-06.
  8. Web site: PSORT II Prediction. psort.hgc.jp. 2019-05-06.
  9. Web site: NCBI CDD CDD Conserved Protein Domain Neuromodulin_N. www.ncbi.nlm.nih.gov. 2019-05-06.
  10. Web site: Coils output. embnet.vital-it.ch. 2019-05-06.
  11. Web site: CFSSP: Chou & Fasman Secondary Structure Prediction Server. www.biogem.org. 2019-05-06.
  12. Web site: Archived . 30 May 2023 . https://web.archive.org/web/20190506022254/https://npsa-prabi.ibcp.fr/cgi-bin/secpred_sopma.pl . May 6, 2019.
  13. Web site: ProP 1.0 Server - prediction results. www.cbs.dtu.dk. 2019-05-06.
  14. Web site: TSBP1 testis expressed basic protein 1 [Homo sapiens (human)] - Gene - NCBI]. www.ncbi.nlm.nih.gov. 2019-05-06.
  15. Web site: Genomatix - NGS Data Analysis & Personalized Medicine. www.genomatix.de. 2019-05-06.
  16. Web site: Human BLAT Search. genome.ucsc.edu. 2019-05-06.
  17. Web site: STRING: functional protein association networks. string-db.org. 2019-05-06.
  18. Zeytuni N, Zarivach R . Structural and functional discussion of the tetra-trico-peptide repeat, a protein interaction module . Structure . 20 . 3 . 397–405 . March 2012 . 22404999 . 10.1016/j.str.2012.01.006 . free .
  19. Zhang M, Ferrari R, Tartaglia MC, Keith J, Surace EI, Wolf U, Sato C, Grinberg M, Liang Y, Xi Z, Dupont K, McGoldrick P, Weichert A, McKeever PM, Schneider R, McCorkindale MD, Manzoni C, Rademakers R, Graff-Radford NR, Dickson DW, Parisi JE, Boeve BF, Petersen RC, Miller BL, Seeley WW, van Swieten JC, van Rooij J, Pijnenburg Y, van der Zee J, Van Broeckhoven C, Le Ber I, Van Deerlin V, Suh E, Rohrer JD, Mead S, Graff C, Öijerstedt L, Pickering-Brown S, Rollinson S, Rossi G, Tagliavini F, Brooks WS, Dobson-Stone C, Halliday GM, Hodges JR, Piguet O, Binetti G, Benussi L, Ghidoni R, Nacmias B, Sorbi S, Bruni AC, Galimberti D, Scarpini E, Rainero I, Rubino E, Clarimon J, Lleó A, Ruiz A, Hernández I, Pastor P, Diez-Fairen M, Borroni B, Pasquier F, Deramecourt V, Lebouvier T, Perneczky R, Diehl-Schmid J, Grafman J, Huey ED, Mayeux R, Nalls MA, Hernandez D, Singleton A, Momeni P, Zeng Z, Hardy J, Robertson J, Zinman L, Rogaeva E . 6 . A C6orf10/LOC101929163 locus is associated with age of onset in C9orf72 carriers . Brain . 141 . 10 . 2895–2907 . October 2018 . 30252044 . 6158742 . 10.1093/brain/awy238 .
  20. Verma A, Basile AO, Bradford Y, Kuivaniemi H, Tromp G, Carey D, Gerhard GS, Crowe JE, Ritchie MD, Pendergrass SA . Phenome-Wide Association Study to Explore Relationships between Immune System Related Genetic Loci and Complex Traits and Diseases . PLOS ONE . 11 . 8 . e0160573 . 2016 . 27508393 . 4980020 . 10.1371/journal.pone.0160573 . 2016PLoSO..1160573V . free .
  21. Zheng W, Rao S . Knowledge-based analysis of genetic associations of rheumatoid arthritis to inform studies searching for pleiotropic genes: a literature review and network analysis . Arthritis Research & Therapy . 17 . 202 . August 2015 . 1 . 26253105 . 4529690 . 10.1186/s13075-015-0715-1 . free .
  22. Malavia TA, Chaparala S, Wood J, Chowdari K, Prasad KM, McClain L, Jegga AG, Ganapathiraju MK, Nimgaonkar VL . Generating testable hypotheses for schizophrenia and rheumatoid arthritis pathogenesis by integrating epidemiological, genomic, and protein interaction data . npj Schizophrenia . 3 . 11 . 2017 . 28560257 . 5441529 . 10.1038/s41537-017-0010-z .
  23. Feng BJ, Sun LD, Soltani-Arabshahi R, Bowcock AM, Nair RP, Stuart P, Elder JT, Schrodi SJ, Begovich AB, Abecasis GR, Zhang XJ, Callis-Duffin KP, Krueger GG, Goldgar DE . Multiple Loci within the major histocompatibility complex confer risk of psoriasis . PLOS Genetics . 5 . 8 . e1000606 . August 2009 . 19680446 . 2718700 . 10.1371/journal.pgen.1000606 . free .
  24. Lin X, Deng FY, Mo XB, Wu LF, Lei SF . Functional relevance for multiple sclerosis-associated genetic variants . Immunogenetics . 67 . 1 . 7–14 . January 2015 . 25308886 . 10.1007/s00251-014-0803-4 . 16113524 .
  25. Web site: BLAST: Basic Local Alignment Search Tool. blast.ncbi.nlm.nih.gov. 2019-05-06.