C20orf144 Explained

Chromosome 20 open reading frame 144 (c20orf144) is a human protein-encoding gene.[1] The human c20orf144 protein consists of 153 amino acids, with the first 150 amino acids being characterized as part of the Bcl-2 like protein of testis (Bclt) family (pfam 15318).[2]

Gene

The c20orf144 gene is located on the plus strand at 20q11.22 and spans 3,293 base pairs.[3] The gene contains two exons. Of the plus strand, 572 nucleotides are antisense to parts of the human genes PXMP4 and NECAB3.[4] Other gene neighbors include ACTL10 and CBFA2T2.[5]

Transcript

The encoded mRNA is 522 nucleotides in length (Accession: NM_080825) and there are no identified alternative splicings.[6] Human c20orf144 mRNA expression is enriched in the testis, specifically in the early and late spermatids.[7]

Protein

The human c20orf144 gene encodes a protein of 153 amino acids in length, and there are three disordered regions (Accession: NP_543015.1). Amino acids 1-150 are a part of the Bclt protein family which is predicted to be involved in apoptosis.[8] The molecular weight is 17.2kDa and the theoretical isoelectric point is 11.47.[9] There are 21 more lysines and arginines, which are positively charged, than there are aspartates and glutamates, which are negatively charged. The tertiary protein structure, produced by AlphaFold,[10] predicts the presence of 3 α helices, and the absence of β sheets in human c20orf144.

Cellular localization

Analysis of the localization of human c20orf144 and many mammalian orthologs predicts localization of c20orf144 in the nucleus, with 78.3% confidence for the human protein.[11]

Post translational modifications

Modification
Modification Site in Human C20orf144
N-Myristoylation[12] 2G
Protein Kinase C Phosphorylation[13] 6S
Casein Kinase 2 Phosphorylation87S
Non-Specific Phosphorylation117S
O-Glycosylation[14] 117S
Protein Kinase C Phosphorylation123S

Evolution and orthologs

The evolutionary rate of C20orf144 is comparable to the high rate of evolution of fibrinogen alpha chain, suggesting the protein is evolving quickly.

Orthologs of the c20orf144 gene in Homo sapiens are found in many mammals excluding monotremes.[15] As shown in Table 2, marsupials are the most distantly related organisms to humans in which proteins encoded by human c20orf144 gene orthologs are found, suggesting that C20orf144 first appeared approximately 160 million years ago.

Genus and Species
Common NameOrderProtein Accession #Median Date of Divergence (MYA)[16] Sequence LengthSequence Identity (%)Sequence Similarity (%)
Homo sapiensHumanNP_543015.10153100100
Macaca mulattaRhesus MonkeyPrimata XP_001105397.1 28.915386.390.8
Piliocolobus tephroscelesUgandan Red ColobusPrimata XP_023076213.128.914163.766.1
Jaculus jaculusLesser Egyptian JerboaRodentiaXP_045011648.18717646.455.8
Myodes glareolusBank VoleRodentiaXP_048287479.18719742.151.8
Mus musculusHouse MouseRodentiaNP_083581.18719741.449.8
Camelus ferusWild Bactrian CamelArtiodactylaXP_032318023.1 941745464.4
Equus caballusDomestic HorsePerissodactylaXP_023482143.19417845.756
Monodon monocerosNarwhalArtiodactylaXP_029075207.19418142.950.5
Physeter catodonSperm WhaleArtiodactylaXP_023984368.1 9414840.848.4
Prionailurus bengalensisLeopard CatXP_043458511.1941795260.9
Ursus arctosBrown BearCarnivora XP_026358671.19418451.661.4
Eumetopias jubatusSteller Sea LionCarnivora XP_027974622.19418447.358.1
Rousettus aegyptiacusEgyptian Fruit BatChiropteraXP_016017694.29417551.462.7
Rhinolophus ferrumenquinumGreater Horseshoe BatChiropteraXP_032951343.1 9419140.251.5
Pteropus vampyrusLarge Flying FoxChiropteraXP_023377960.1942094050.5
Choloepus didactylusSouthern Two-Toed SlothPilosaXP_037668100.19918847.957.4
Gracilinanus agilisAgile Gracile Mouse OpossumDidelphimorphiaXP_044517537.116016937.949.7
Dromiciops gliroidesMonito del MonteMicrobiotheriaXP_043845608.11601703750.8
Sarcophilus harrisiiTasmanian DevilDasyuromorphiaXP_031809718.116016036.450

Clinical significance

In a study of 28 breast cancer patients, missense mutations in c20orf144 were found in approximately 33% of patients, suggesting a potential role for c20orf144 in the development of breast cancer.[17] Furthermore, c20orf144 is listed in primary renal proximal tubule epithelial cells as a top candidate hit in an siRNA screen, which silences targeted genes.[18] The silencing of c20orf144 in cells exposed to Shiga toxin resulted in metabolic activity that was greater than or equal to 90% of that in a typical cell.

Notes and References

  1. Web site: C20orf144 chromosome 20 open reading frame 144 [Homo sapiens (human)] - Gene - NCBI ]. 19 September 2022.
  2. Web site: RecName: Full=Uncharacterized protein C20orf144; AltName: Full=Bcl-2-like protein from testis; Short=Bclt - Gene - NCBI. 19 September 2022.
  3. Web site: C20orf144 Gene - Chromosome 20 Open Reading Frame 144 . GeneCards . 29 September 2022.
  4. Web site: Gene C20orf144 . AceView . 29 September 2022.
  5. Web site: Entry on C20orf144 . UCSC Genome Browser . 5 December 2022.
  6. Web site: Homo sapiens chromosome 20 open reading frame 144 (C20orf144), mRNA - Gene - NCBI . 24 June 2021 . 19 September 2022.
  7. Web site: Human Protein Atlas C20orf144 entry . The Human Protein Atlas . 8 December 2022.
  8. Web site: NCBI Entry on Bclt . National Center for Biotechnology Information . 15 December 2022.
  9. Web site: Compute pI/MW . Expasy Swiss Bioinformatics Resource Portal . 15 December 2022.
  10. Web site: AlphaFold Protein Structure Database entry on Human C20orf144 . 2022-12-15 . alphafold.com.
  11. Web site: PSORT II Prediction . PSORT WWW Server . 7 December 2022.
  12. Web site: Myristoylator . Expasy Swiss Bioinformatics Resource Portal . 15 December 2022.
  13. Web site: Phosphorylation Sites in Eukaryotic Proteins . NetPhos-3.1 . DTU Health Tech . 15 December 2022.
  14. Web site: O-(beta)-GlcNAc glycosylation and Yin-Yang sites . YinOYang-1.2 . DTU Health Tech . 15 December 2022.
  15. Web site: C20orf144 Entry . Protein BLAST . 23 October 2022.
  16. Kumar S, Suleski M, Craig JM, Kasprowicz AE, Sanderford M, Li M, Stecher G, Hedges SB . TimeTree 5: An Expanded Resource for Species Divergence Times . Molecular Biology and Evolution . 39 . 8 . August 2022 . 35932227 . 9400175 . 10.1093/molbev/msac174 .
  17. Aravind Kumar M, Singh V, Naushad SM, Shanker U, Lakshmi Narasu M . Microarray-based SNP genotyping to identify genetic risk factors of triple-negative breast cancer (TNBC) in South Indian population . Molecular and Cellular Biochemistry . 442 . 1–2 . 1–10 . May 2018 . 28918577 . 10.1007/s11010-017-3187-6 . 254795241 .
  18. Web site: MacMaster K . Characterization of cellular pathways and potency of Shiga toxin on endothelial cells . University of Cincinnati . 15 December 2022.