Chromosome 19 open reading frame 18 (c19orf18) is a protein which in humans is encoded by the c19orf18 gene. The gene is exclusive to mammals and the protein is predicted to have a transmembrane domain and a coiled coil stretch.[1] This protein has a function that is not yet fully understood by the scientific community.
Aliases of this gene include MGC41906 and LOC147685. The gene is located on chromosome 19 at 19q13.43.[2] The gene spans from 58,485,905 bp to 58,469,805 bp on the minus strand and contains 6 exons and 5 introns. Transcription of this gene produces one spliced mRNA which codes for the protein c19orf18.
C19orf18 is ubiquitously expressed at moderate levels. In humans, there is higher expression in the testis, prostate, lung, liver, pancreas, uterus, heart, and other connective tissues.[3] [4]
There are no known paralogs of this gene in the human genome.[5]
The gene is exclusive to mammals. The transmembrane domain is the most conserved region among close orthologs and distant homologs. The following table presents some of the orthologs found using searches in BLAST.[6] This list does not contain all of the orthologs for c19orf18. It is meant to display the diversity of species for which orthologs are found. They are sorted by date of divergence and then protein similarity.
Species | Date of Divergence (MYA) | Accession Number | Sequence length (aa) | Identity | Similarity | |
---|---|---|---|---|---|---|
Homo sapiens (Humans) | 0 | NP_689687.1 | 215 | 100% | 100% | |
Pongo abelii (Orangutan) | 15.2 | XP_002829939.1 | 216 | 92% | 94% | |
Rhinopithecus roxellana (Golden snub-nosed Monkey) | 28.1 | XP_010385277.1 | 216 | 84% | 90% | |
Carlito syrichta (Philippine tarsier) | 66.7 | XP_008066887.1 | 217 | 70% | 81% | |
Otolemur garnettii (Galago) | 73 | XP_012663984.1 | 183 | 50% | 62% | |
Mus musculus (Mouse) | 88 | XP_017167821.1 | 183 | 46% | 63% | |
Oryctolagus cuniculus (European rabbit) | 88 | XP_008247222.1 | 242 | 49% | 62% | |
Rhinolophus sinicus (Horseshoe bat) | 94 | XP_019567114.1 | 284 | 70% | 82% | |
Vicugna pacos (Alpaca) | 94 | XP_015107013.1 | 214 | 65% | 80% | |
Canis lupus familiaris (Dog) | 94 | XP_005616108.1 | 223 | 49% | 61% | |
Bos taurus (Cow) | 94 | XP_015313970.1 | 250 | 44% | 53% | |
Ornithorhynchus anatinus (Platypus) | 169 | XP_007664656.1 | 308 | 34% | 57% |
The coding sequence contains 215 amino acids. The molecular weight of c19orf18 is 24.151 kdal and the isoelectric point for the unphosphorylated state is 9.06.[7] The protein sequence is rich in leucine and is deficient in tryptophan, cysteine, and tyrosine. There is a negative charge cluster from amino acid 149 to 172.[8]
There is a cross-program consensus between GOR4, CFSSP, and PHYRE2 that the protein structure contains mostly coiled regions and alpha helices.[9] [10] [11]
The protein sequence is predicted to contain a signal peptide (1 aa to 24 aa), an extracellular domain (25 aa to 100 aa), a transmembrane domain (101 aa to 121 aa), and a cytoplasmic domain (122 aa to 215 aa).[12]
PSORTII and CELLO predicted that the human protein would localize to the plasma membrane and part of it would be in the extracellular region.[13] [14] Immunofluorescent staining of human cell line U-2 OS shows localization to the Golgi apparatus.[15]
C19orf18 protein has been predicted to interact with several proteins listed in the table below. The interactions have been identified and verified through affinity capture-MS.[16]
Predicted interacting protein name | Score | Experimental verification | |
---|---|---|---|
Nedd4 family interacting protein 1 | 0.9165 | Affinity capture-MS | |
Activin A receptor, type IIA | 0.7829 | Affinity capture-MS | |
Syntaxin 6 | 0.9679 | Affinity capture-MS | |
Bone morphogenetic protein receptor type 1A | 0.8914 | Affinity capture-MS | |
Fibroblast growth factor receptor 2 | 0.8789 | Affinity capture-MS | |
Microfibrillar-associated protein 3 | 0.8756 | Affinity capture-MS |
The c19orf18 protein is down-regulated in pancreatic cancer[23] and contains CpG sites found to be replicated for association with epithelial ovarian cancer risk.[24] The gene also decreases in expression in teratozoospermia[25] and increases in expression in polycystic ovary syndrome.[26] The gene may also be involved in prostate cancer and various tumors