CCDC186 explained

CCDC186 is a protein that in humans is encoded by the CCDC186 gene^[1] The CCDC186 gene is also known as the CTCL-tumor associated antigen with accession number NM_018017.^[2]

Gene

Location

CCDC186 has the chromosome location of 10q25.3 and is 53,750 bases in size oriented on the minus strand. PSORTII Protein k-NN Prediction indicated that C10orf118 is 65.2% of the time nuclear, 17.4% cytosolic, 8.7% mitochondrial, 4.3% vesicles of secretory system, and 4.3% endoplasmic reticulum.^[3]

Expression

Analysis of gene expression in humans and other species indicates C10orf118 is ubiquitously expressed in all tissue types at varying developmental stages. An EST profile from NCBI displayed the greatest expression in bone marrow, kidneys, and the prostate cell lines. Breakdown by health state indicates high expression of C10orf118 in bladder carcinoma and prostate cancer.^[4]

Protein

General Properties

The protein of CCDC186 (NP_060487) is 898 amino acids in length. The predicted molecular weight is 103.7kdal and the isoelectric point is predicted to be 5.92.^[5]

Composition

A serine rich region is observed in amino acids 710-747. A compositional analysis revealed that C10orf118 is Proline (1.1%) poor and Glutamic acid (14.1%) and Lysine (12.0%) rich.

Interactions

CCDC186 protein was found to interact with proteins PLEKAH5, Ezra, GAMMAHV.ORF23, and SMAD3.^[6] ^[7]

Homology

Orthologous sequences of CCDC186 were not found to be in bacteria, archea, protist, or plants. CCDC186 has no human paralogs. The date of divergence for the orthologous sequences highly correlates with the sequences similarity in that the percent identity decreases as you go back in time. Closely related orthologs include mammals and birds and moderately related orthologs include other vertebrates such as fish, reptiles, and amphibians. Distantly related orthologous sequences are primarily observed in invertebrates.^[8]

Sequence Number	Genus and species	Common name	Date of divergence (MYA)	Accession number	Sequence length	Sequence identity	Sequence similarity	E-value
1	Homo sapiens	C10orf118	0	NM_018017	898 aa	100%	100%	0%
2	Nomascus leucogenys	Northern White-cheeked Gibbon	19.9	XP_003255542	898 aa	98%	98%	0%
3	Chinchilla lanigera	long tailed chinchilla	90.9	XP_013371303	906 aa	90%	94%	0%
4	Mus musculus	house mouse	90.9	NP_739563	917 aa	84%	91%	0%
5	Tursiops truncatus	bottlense dolphin	97.5	XP_004318425	599 aa	60%	68%	0%
6	Loxodonta africana	African Bush Elephant	105	XP_010596062	805 aa	90%	91%	0%
7	Egretta garzetta	little egret	320.5	XP_009634185	914 aa	75%	83%	0%
9	Chelonia mydas	geen sea turtle	320.5	XP_007056391	920 aa	76%	83%	0%
10	Xenopus tropicalis	western clawed frog	355.7	XP_004919435	878 aa	71%	83%	0%
11	Astyanax mexicanus	Mexican tetra	429.6	XP_007259323	987 aa	71%	82%	5.00E-09
12	Oryzias latipes	Japanese rice fish	429.6	XP_011486278	991 aa	58%	71%	0%
13	Callorhinchus milii	Australian ghostshark	482.9	XP_007903851	931 aa	65%	79%	0%
14	Strongylocentrotus purpuratus	Sea Urchin	747.8	XP_011675415	884 aa	39%	58%	7.00E-84
15	Octopus bimaculoides	California Two-spot Octopus	847	XP_014782587	1221 aa	42%	60%	3.00E-127
16	Aplysiomorpha	sea hare	847	Not found in BLAST	1384 aa	42%	63%	3.00E-145
17	Orussus abietinus	wasp	847	XP_012288999	1057 aa	36%	55%	5.00E-100
18	Atta cephalotes	leafcutter ant	847	XP_012054311	1154 aa	35%	55%	9.00E-93
19	Helobdella robusta	leech	847	ESO07814	997 aa	29%	47%	1.00E-50
20	Caenorhabditis elegans	nematode	847	NP_871700	743 aa	26%	34%	1.00E-25
21	Trichoplax adhaerens	NA	936	EDV25816	976 aa	26%	46%	2.00E-11

Motifs

Motif Information^[9]	Position(s)
N-glycosylation site	33-36, 40-43, 109-112, 867-870
cAMP- and cGMP-dependent protein kinase phosphorylation site.	238-241, 583-586, 794-797
Casein kinase II phosphorylation site.	2-5, 10-13, 31-34, 36-39, 126-129, 130-133, 135-138, 139-142, 157-160, 228-231, 241-244, 256-259, 405-408, 440-443, 459-462, 490-493, 515-518, 611-614, 617-620, 625-628, 671-674, 690-693, 734-737, 869-872, 879-882
Leucine Zipper Pattern	588-609, 870-891
N-myristoylation site	32-37, 75-80, 86-91, 183-188, 721-726, 728-733, 745-750, 810-815, 828-833
Protein kinase C phosphorylation site	12-14, 35-37, 49-51, 146-148, 228-230, 243-245, 250-252, 278-280, 347-349, 371-373, 405-407, 417-419, 490-492, 500-502, 581-583, 592-594, 596-598, 682-684, 690-692, 825-827, 869-871, 895-897
Tyrosine kinase phosphorylation site	134-141, 514-522
Cullen Family Profile	443-552
K-Box Domain Profile	526-619
Nebulin Repeat Profile	521-531
Serine Rich Region	710-747
Rad50 zinc-hook domain profile	460-562
ATP synthase B/B' CF(0)	405-433
Alpha-2-macroglobulin RAP, C-terminal domain	524-705
	334-577

Post Translational Modification

CCDC186 is predicted to undergo multiple posttranslational modifications including predicted O-beta-GlcNAc attachment, phosphorylation, a nuclear export signal, glycation of lysines, GlcNAc O-glycosylation, N-glycosylation, and NetCorona sites.^[10]

Clinical significance

Prior research indicates that the open reading frame of C10orf118 is linked to cutaneous T-cell lymphoma by a tumor antigen L14-2.^[11] The protein CCDC186 is also found at higher than normal levels in the breast cancer cell line BC 8701.^[12]

Notes and References

Oduru S, Campbell JL, Karri S, Hendry WJ, Khan SA, Williams SC . Gene discovery in the hamster: a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs . BMC Genomics . 4 . 1 . 22 . June 2003 . 12783626 . 161800 . 10.1186/1471-2164-4-22 . free .
Web site: Entrez Gene: C10orf118 chromosome 10 open reading frame 118.
Web site: PSORT II Prediction. psort.hgc.jp. 2016-04-24.
Web site: Home - UniGene - NCBI. www.ncbi.nlm.nih.gov. 2016-04-24.
Web site: SDSC Biology Workbench. seqtool.sdsc.edu. 2016-04-24. https://web.archive.org/web/20030811031200/http://seqtool.sdsc.edu/. 2003-08-11. dead.
Web site:
- in Literature citations
. www.uniprot.org. 2016-04-24.
Web site: CCDC186 - Coiled-coil domain-containing protein 186 - Homo sapiens (Human) - CCDC186 gene & protein.
Web site: BLAST: Basic Local Alignment Search Tool. blast.ncbi.nlm.nih.gov. 2016-04-24.
Web site: Motif Scan. myhits.isb-sib.ch. 2016-05-08.
Web site: Welcome to CBS. www.cbs.dtu.dk. 2016-04-24.
de la Peña M, García-Robles I . Intronic hammerhead ribozymes are ultraconserved in the human genome . EMBO Reports . 11 . 9 . 711–6 . September 2010 . 20651741 . 2933863 . 10.1038/embor.2010.100 .
D’Angelo. M. L.. 2013. Identification of the "Uncharacterized Protein C10orf118" in Breast Cancer Cells and Its Role on the Hyaluronan Metabolism. Matrix Pathobiology, Signaling and Molecular Targets..