Chitinase domain-containing protein 1 (CHID1) is a highly conserved protein of unknown function located on the short (p) arm of chromosome 11 near the telomere.[1] The protein has 27 introns, which allows for many isoforms of this gene. It has several aliases, the most common of which is Stabilin-1 interacting chitinase-like protein (SI-CLP). As indicated by the alias, CHID1 is known to interact with the protein STAB1.[2] CHID1 is expressed ubiquitously at levels nearly 6 times the average gene,[3] and is conserved very far back to organisms such as Caenorhabditis elegans and possibly some prokaryotes. This protein is known to have carbohydrate binding sites, which could be involved in carbohydrate catabolysis.
CHID1 is located on chromosome 11 at the location p15.5.[4] It is just downstream of TSPAN4 and upstream of AP2A2.[5] CHID1 is ubiquitously expressed at a high levels. Through microarray analysis, it has been shown that CHID1 is generally expressed at 5.7 times the average gene. CHID1 has many known variants, which is attributed to its 37 exons. There are no inherent repeats or hairpin structures to be found within the coding region of CHID1. This gene is a member of the GH18 superfamily, which dictates some of its protein structure. This gene also has several aliases, the most common of which is Stabilin-1 interacting chitinase-like protein, or SI-CLP, which indicates its known interaction with STAB1.[6]
Due to its large size and many exons, CHID1 has many transcript variants that have been identified through mRNA sequencing. CHID1 has 27 exons and 22 known splice forms.[3] These forms indicate that there may be multiple promotor regions and transcription start sites used within the genome. The most commonly found transcripts translate to about 400 amino acids each.
CHID1 shows a very high level of conservation. It has been identified in many model systems including Drosophila melanogaster, Caenorhabditis elegans, and Oryza sativa.[7] When conservation is observed over a large region, it becomes clear which protein domains are most important to CHID1. By far, the two best conserved factors are exon junctions you can add a link to exon junction, as well as several known carbohydrate binding sites. Another important region appears to be the last 15 amino acids, which retain a high level of conservation even through insect sequences.
Species | Common name | Divergence in millions of years | Accession number | Similarity to human sequence | |
---|---|---|---|---|---|
Gorilla gorilla gorilla | Gorilla | 8.8 | XP_004050445.1 | 100% | |
Macaca mulatta | Rhesus macaque | 29.0 | AFI38071.1 | 94% | |
Equus caballus | Horse | 94.2 | XP_003362692.1 | 91% | |
Felis catus | House cat | 94.2 | XP_003993852.1 | 90% | |
Taeniopygia guttata | Finch | 296 | XP_002199280.2 | 78% | |
Ictalurus punctatus | Catfish | 400.1 | NP_001187344.1 | 70% | |
Apis florea | Honey bee | 782.7 | XP_003691984.1 | 61% |
There are also several proposed paralogs to CHID1. These genes are di-N-acetylchitobiase, Chitinase-3-like protein 2, and chitotriosidase 1. All of these genes have roughly 40% similarity, and match human transcripts of CHID1 over 40-50% of the gene.[8] Based on the level of conservation, these paralogs all likely split at a time near to the split from bacteria.
Name | Accession number | Length (amino acids) | Similarity to CHID1 in humans | |
---|---|---|---|---|
di-N-acetylchitobiase | NP_004379.1 | 385 | 38% | |
Chitinase-3-like protein 2 | NP_001020368.1 | 380 | 39% | |
Chitotriosidase-1 | NP_001243054.2 | 447 | 40% |
The protein translation of CHID1 is typically about 400 amino acids long (though this varies within the many known forms), and has few post-translational modifications.[9] In most forms and orthologs, it is predicted that CHID1 has a signal peptide that varies in length by sequence.[10] Structure predictions and x-ray crystallography structures of CHID1 indicate that its secondary structure is heavy in alpha helices, though it definitely has some beta strands present, as indicated by the SI-CLP x-ray crystallography structure. CHID1 is also a member of the GH18 superfamily of proteins, which has a unique conserved structure.[11] Members of this family contain an 8 stranded beta/alpha barrel. This protein is also shown to interact with other copies of itself in a homodimer, which also uses sulfate ion interacting molecules for an unknown purpose.
The expression of CHID1 is consistently shown to be ubiquitous and higher than average in all human tissues.[12] [13] [14] When analyzing the promoter region of CHID1 in humans, this expression is explained by the large number of transcription factor binding sites which are active in a wide range of tissues or are even ubiquitous.[15] In Drosophila melanogaster, tissue expression varies wildly between larval and adult tissue.[16] While expression is extremely widespread in adults, it is not at such a high level as in humans. In larvae, CHID1 is only shown to be expressed in specific tissues, and is up-regulated in very different tissues than in adults. In larval tissues, expression is highest in the salivary gland, but in adults the highest expression by far is in the male accessory gland.[16] Though expression is generally high in some species, it is not necessarily abnormal. In situ hybridization of gene transcripts in mouse brains shows that CHID1 is expressed relatively normally.[17] The pattern of high and low expression with the brain is very similar to another widespread gene: beta actin.[18] This is one indication that CHID1 may show normal expression patterning in mammals, despite its general upregulation.
Although CHID1 has no known function for certain, there are several proposed activities of the protein based on current knowledge. This protein may participate in metabolic processes such as chitin catabolism or carbohydrate metabolism. It may locate in various cellular compartments such as the cytoplasm, or in lysosomes all depending on certain post-translational modifications.[19] Another study proposed that CHID1 may have roles in pathogen sensing.[11] The function of CHID1 may also depend on its putative binding partner, STAB1, which is proposed to participate in cell signaling and defense against bacterium.[20]
CHID1 is only strongly suggested to interact with one other protein. The transmembrane protein stabilin-1 has been detected as an interactant by in vitro, in vivo, and yeast two-hybrid assays.[6] STAB1 is a large transmembrane receptor protein which may function in many aspects such as lymphocyte homing or angiogenesis. It is expressed at over twice the average gene level and is expected to play a role in cell defense against bacterium.[21] This interaction may give insight towards the function of both proteins as a whole.
Overall, the significance of CHID1 is unknown. Many expression profiles of CHID1 show that its expression does not change in any common diseases or conditions as currently studied.[22] Given the lack of knowledge about the function of CHID1, it is difficult to further study its role within the human condition.