CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).
The CXorf49 gene has one alias CXorf49B.[1] The recname A8MYA2 also refers to the protein coded by CXorf49 or CXorf49B.[2]
CXorf49 is located on the X chromosome at Xq13.1. It is 3912 base pairs long and the gene sequence has 6 exons.[3] CXorf49 has one protein coding transcript.[4]
The protein has 514 amino acids and a molecular mass of 54.4 kDa.[5] The isoelectric point is 9.3. Compared to other human proteins CXorf49 is glycine- and proline-rich, but the protein has lower levels of asparagine, isoleucine, tyrosine and threonine(Statistical Analysis of Protein Sequences, SAPS [6]).
The domain of unknown function, DUF4641, is almost the entire protein. It is 433 amino acids long, from amino acid 80 until amino acid number 512.[7] DUF4641 is a part of pfam15483.[8] The domain is proline- and arginine-rich, but DUF4641 has lower levels of isoleucine, tyrosine and threonine compared to other proteins in human (Analysis of Protein Sequences, SAPS). DUF4641 has an unusual spacing between lysine residues and positive charged amino acids (Analysis of Protein Sequences, SAPS).
CXorf49 is predicted to have several post-translational sites. This include sites for N-acetyltransferase (NetAcet 1-[9]), glycation of ε amino groups of lysines (NetGlycate 1.0[10]), mucin type GalNAc O-glycosylation (NetOglyc 4.0[11]), phosphorylation (NetPhos 2.0[12]), sumoylation (SUMOplot Analysis Program[13]) and O-ß-GlcNAc attachment(YinOYang WWW[14]).
The CXorf49 protein has been predicted to be located in the cell nucleus (PSORT II [15]).
The promoter region of CXorf49 is located between base pair 71718051 and 71718785 on the minus strand of the X chromosome and it is 735 bp long (Genomatix’s ElDorado program[16]). One of the most frequent transcription factor binding-sites in the promoter region are sites for Y-box binding factor.
Though expression of CXorf49 is very low in human cells, is it somewhat higher in connective tissues, testis and uterus(NCBI-Unigene [17]).
The protein CXorf49 has not yet been shown to interact with other proteins (PSICQUIC[18]).
CXorf49 is found to be one of the components of a small group of the HL-60 cell proteome that were most prone to form 4-Hydroxy-2-nonenal(HNE) adducts, upon exposure to nontoxic (10 μM) HNE concentrations, along with heat shock 60 kDa protein 1.[19]
Using BLAST[20] no orthologs for CXorf49 are found in single celled organisms, fungi or plants whose genomes have been sequenced. For multicellular organisms orthologs are found in mammals. The table below show a selection of the mammal orthologs. They are listed after time of divergence from human.
Genus and species name | Common name | Accession Number | Sequence length | Identity to human protein | |
---|---|---|---|---|---|
Pan troglodytes | Chimpanzee | XP_001137982 | 514 aa | 98 % | |
Callithrix jacchus | Common marmoset | XP_008987719 | 487 aa | 65 % | |
Galeopterus variegatus | Malayan flying lemur | XP_008574823 | 525 aa | 54 % | |
Tupaia chinensis | Chinese tree shrew | XP_006168003 | 527 aa | 35 % | |
Chinchilla lanigera | Long-tailed chinchilla | XP_013358263 | 307 aa | 49 % | |
Mus musculus | House mouse | NP_081944 | 513 aa | 36 % | |
Canis lupus familiaris | Dog | XP_850392 | 526 aa | 54 % | |
Odobenus rosmarus divergens | Pacific walrus | XP_012422579 | 530 aa | 51 % | |
Mustela putorius furo | Ferret | XP_004777306 | 544 aa | 50 % | |
Lipotes vexillifer | Chinese river dolphin | XP_007452050 | 529 aa | 45 % | |
Ovis areis | Sheep | XP_004022229 | 536 aa | 45 % | |
Capra hircus | Goat | XP_005700711 | 538 aa | 44 % | |
Myotis lucifugus | Little brown bat | XP_006083036 | 500 aa | 42 % | |
Myotis davidii | David's myotis | XP_006759573 | 495 aa | 42 % | |
Bos taurus | Cattle | NP_001092664 | 534 aa | 42 % | |
Equus asinus | Asinus | XP_014707878 | 723 aa | 42 % | |
Trichechus manatus latirostris | Florida manatee | XP_012415455 | 505 aa | 44 % | |
Dasypus novemcinctus | Nine-banded armadillo | XP_004475873 | 497 aa | 44 % | |
Orycteropus afer afer | Aardvark | XP_007957133 | 477 aa | 38 % |
CXorf49 has developed from aardvarks, to the human protein over 105.0 million years.