CUX1 is an animal gene. The name stands for Cut like homeobox 1. The term "cut" derives from the "cut wing" phenotype observed in a mutant of Drosophila melanogaster. In mammals, a CCAAT-displacement activity was originally described in DNA binding assays. The human gene was identified following purification of the CCAAT-displacement protein (CDP) and has been successively been called CDP, Cut-like 1 (CUTL1), CDP/Cut and finally, CUX1. (Consequently, a literature search for CUX1 fails to bring up many of the earlier studies). Cut homeobox genes are present in all metazoans. In mammals, CUX1 is expressed ubiquitously in all tissues. A second gene, called CUX2, is expressed primarily in neuronal cells.
Multiple reviews have detailed the tissue-specific functions and genetic interactions in Drosophila and the early characterization of the CDP in mammalian cells, expression and activity of CUX1 in the myeloid cell lineage, multiple CUX1 isoforms, modes of DNA binding, early mouse models and cell-based assays, roles of CUX1 in kidney development and homeostasis, mechanisms by which CUX1 stimulates cell migration and invasion, roles of CUX1 and CUX2 in neurons of the cortex upper layer and the paradoxical implications of CUX1 in cancer. In addition, a number of commentaries present more specific views and speculations.
The human CUX1 gene is large, encompassing more than 440,000 base pairs with two alternative first exons and an additional 23 exons. The last exon has a weak polyadenylation site allowing RNA polymerase II often to continue transcribing until it reaches an additional 10 exons. Splicing of this longer transcript from exon 14 to exon 25 generates a mature mRNA that codes for a protein that was called CASP (Cut alternatively spliced product). CASP localizes to the Golgi and does not seem to impact at all on CUX1 function. However, because of the complex structure of the gene, most oligos in microarrays were derived from the most 3' exons that are unique to CASP. Thus, until the advent of RNA sequencing CUX1 expression data has been essentially limited to immunohistochemical analyses. Similarly, many guide RNAs in CRISPR-Cas screening studies target the CASP-specific exons and do not affect CUX1.
The use of two transcription start sites, alternative splicing of some exons and proteolytic processing combine to generate several CUX1 protein isoforms (reviewed) The full-length protein, often referred to as p200 CUX1, contains five evolutionarily conserved domains: a coiled-coil (CC), three Cut domains (C1, C2 and C3), originally called Cut repeats (CRs), and a Cut homeodomain (HD). The coiled-coil's function remains to be defined. The three Cut domains and the Cut homeodomain were originally characterized as DNA binding domains, and were later found to be involved also in protein-protein interactions. In addition to these conserved domains, the n-terminal 100 amino acids contains an autoinhibitory domain, while the carboxy-terminal region downstream of the homeodomain was shown to function as an active repression domain. In mid to late G1, the full-length protein is proteolytically processed to generate a shorter isoform, p110 CUX1 that lacks approximately the n-terminal 747 amino acids and thus contains only 3 DNA binding domains: CUT domains 2 and 3 and the Cut homeodomain (C2C3HD).
DNA binding assays with histidine-tagged fusion proteins showed that one Cut domain is not sufficient for DNA binding, while several combination of domains were found to bind to DNA with distinct affinities and kinetics: CR1CR2, CR3HD and CR2CR3HD. The C1C2 protein displayed very rapid "on" and "off" DNA binding rates, whereas any combination of a Cut domain with the Cut homeodomain exhibited slower binding kinetics. Interestingly, the full-length CUX1 protein purified from insect cells exhibited DNA binding kinetics similar to that of CR1CR2, suggesting that the Cut homeodomain may not be active in the context of the full-length protein.
p200 CUX1 is a very abundant protein that binds to DNA with extremely fast kinetics (rapid "on" and "off" rates). This is not consistent with a role as a classical transcription factor that binds stably to DNA and recruits a co-activator or a co-repressor, however, the p200 CUX1 protein may still be able to repress transcription by competition for binding site occupancy. Indeed, CUX1 was originally purified as the CCAAT-displacement protein (CDP). In addition to this potential role in transcriptional repression, recent studies revealed that p200 CUX1 functions as an auxiliary factor in base excision repair. In vitro, CUT domains of CUX1 stimulate the enzymatic activities of the 8-oxoguanine DNA glycosylase (OGG1) and the apurinic/apyrimidinic endonuclease 1 (APE1). In agreement with results from in vitro DNA repair assays, a p200 CUX1-GFP fusion protein is rapidly recruited to DNA damage generated by laser microirradiation, while CUX1 knockdown in human cells causes a delay in the repair of oxidative DNA damage and mono-alkylated bases in genomic DNA, and sensitizes cancer cells to treatments with H2O2, ionizing radiation and the mono-alkylating agent temozolomide. Conversely, p200 CUX1 overexpression increases the resistance of cancer cells to the same treatments. Structure/function analysis established that a fusion protein containing Cut domains 1 and 2 linked to a nuclear localization signal (C1C2-NLS) is rapidly recruited to DNA damage and is sufficient to accelerate the repair oxidative DNA damage and mono-alkylated bases in genomic DNA. Since the C1C2-NLS protein is devoid of transcription activation potential, these results suggest that Cut domains are directly involved in DNA repair.
The shorter p110 CUX1 isoform stably interacts with DNA and can function as transcriptional repressor or activator depending on promoter context. Transcription and cell-based assays demonstrated a role for p110 CUX1 in stimulating cell cycle progression and cell proliferation, strengthening of the spindle assembly checkpoint, ensuring an efficient DNA damage response, promoting cell migration and invasion, and increasing resistance to apoptotic signals. Of note, the p200 CUX1 protein was inactive in all these transcriptional and cell-based assays.
p110 CUX1 has been demonstrated to bind the protective allele of FTO single nucleotide polymorphisms (SNPs) rs8050136 or rs1421085 highly associated with human obesity, and promote the expression of RPGRIP1L.[1] [2] [3] RPGRIP1L hypomorphism in mice results in obesity,[4] [5] suggesting an important role of CUX1 in body weight regulation.
CUX1 is overexpressed in many cancers. The comprehensive molecular characterization of human colon and rectal cancer performed by The Cancer Genome Atlas (TCGA) ranked CUX1 as the fifth gene on a scale showing a correlation between tumour aggressiveness and gene expression/somatic copy number alterations. TCGA and REMBRANDT data also show shorter survival of glioblastoma patients with high CUX1 mRNA expression (reviewed). In smaller scale studies, immunohistochemical analyses on breast, pancreas and glioblastoma cancers reveal that CUX1 expression inversely correlates with relapse-free and overall survival. An alternative CUX1 transcript that is initiated within intron 20 and codes for a p75 isoform is expressed specifically in the testis and thymus. This transcript was found to be aberrantly expressed in many breast tumour cells lines and breast tumours. Transgenic mice expressing this transcript in mammary epithelial cells were shown to develop mammary tumours with metastasis to the lung.
CUX1 transgenic mice develop tumors in multiple organs and tissues after a long latency period. Transgenic mice expressing either p75, p110 or p200 CUX1 in mammary epithelial cells develop mammary tumours with a low penetrance (~20%) and with a long latency period. Metastasis to the lung was observed in three cases of mammary tumors expressing p75 CUX1.
The mechanisms of action differ widely between p110 CUX1 and p200 CUX1. Transgenic mice expressing p110 CUX1 in mammary epithelial cells develop mammary tumors that exhibit a sub-tetraploid chromosome number, suggesting passage through tetraploidy resulting from cytokinesis failure. Cells do not usually survive as tetraploid with multiple centrosomes. Overexpression of p110 CUX1 was shown to activate a transcriptional program that reinforces the spindle assembly checkpoint and delays mitosis until extranumerary centrosomes have clustered to two poles, thereby enabling bipolar mitosis and survival of tetraploid cells. Yet, passage through a multipolar intermediate enriches for merotelic chromosome attachments, leading to chromosome mis-segregation and the rapid generation of aneuploid populations from which tumorigenic cells emerge.
Tumors that develop in p200 CUX1 transgenic mice reveal a different mode of action, as 44% of these tumors harboured a spontaneous mutation activating the Kras gene. The cooperation between RAS and p200 CUX1 was confirmed by lentiviral infections in the lung. RAS oncogenes do not transform primary cells, but instead cause cellular senescence. This results from the elevated production of reactive oxygen species (ROS) that lead to oxidative DNA damage. Biochemical and cell-based assays demonstrated that the CUT domains within CUX1 and other Cut domain proteins stimulate the enzymatic activities of some enzymes of the base excision repair pathway and accelerate the repair of oxidative DNA damage. Hence, human and primary rodent fibroblasts that are transfected with KRAS and CUX1 exhibit similar ROS levels as cells that receive only KRAS but show much reduced DNA damage. In agreement with these findings, CUX1 knockdown is synthetic lethal in all cancer cells exhibiting high levels of ROS as a consequence of activating mutations in either KRAS, HRAS, BRAF or EGFR.
Loss-of-heterozygosity (LOH) of the 7q22.1 chromosomal region, where CUX1 resides, was reported in 8–22% of various cancer, and in close to 50% in therapy-associated leukemias. As no inactivating mutations were found in the remaining allele, the notion that CUX1 could be the tumor suppressor gene on 7q22.1 was dismissed for some time. However, refined mapping in leiomyomas and myeloid leukemias eventually pointed to CUX1 as the sole tumor suppressor gene in this region. These genetic data suggest that CUX1 may be a haploinsufficient tumor suppressor gene. In addition to LOH events, large scale DNA sequencing identified inactivating point mutations in 1–5% of cancers where both alleles are present. There is no tumor case where both CUX1 allele are inactivated.
While the biochemical activities of CUX1 that explain its role in tumor maintenance and progression have been well defined, the biochemical functions of CUX1 involved in tumor suppression have yet to be firmly established. Two mechanisms have been proposed.One study reported that CUX1 functions as a transcriptional activator of PIK3IP1, a gene that codes for the phosphoinositide-3-kinase interacting protein 1 (PIK3IP1), a direct inhibitor of the PI3K p110 catalytic subunit. CUX1 knockdown caused a decrease in PIK3IP1 expression that was associated with an increase in PI3K signaling and AKT signaling. A separate study, however, argued that CUX1 is itself a target of AKT signaling. Activation of the PI3K–AKT signaling pathway by insulin-like growth factor 1 (IGF1) or by AKT2 overexpression led to the upregulation of CUX1 and was associated with resistance to apoptosis, whereas treatment of cells with the PI3K inhibitor LY294002 decreased CUX1 expression and increased apoptosis. Whether the discrepancies in the results and conclusions from these two studies can be explained by differences in cell-types or other reasons remains to be verified.The auxiliary role of CUX1 in base excision repair has raised the possibility that its tumor suppressor function may be linked to its function in DNA repair. CUX1 knockdown delays DNA repair in multiple cell lines. Mouse embryo fibroblasts (MEFs) derived from a Cux1-/- knockout mouse exhibit increased genomic instability23, moreover Cux1+/−- heterozygous MEFs are haploinsufficient for DNA repair18. Yet, whether CUX1 hemizygosity augments the risk of cancer by increasing the frequency of mutations and/or genomic rearrangements remains to be formally tested.