CFAP97D2 explained

CFAP97D2

CFAP97D2 (Cilia and Flagella Associated Protein 97 Domain containing 2) is a protein that in humans is encoded by the CFAP97D2 gene.

Gene

Homo sapiens CFAP97D2 gene (XM_017020910.2) is a 68,395 base pair gene that encodes mRNA transcripts ranging from 952 nucleotides to 1841 nucleotides.[1] It is located on Chromosome 13 and found at locus 13q34 on the plus strand. CFAP97D2 is on Chromosome 13. This gene encodes 166 amino acids that make up Cilia and Flagella Associated Protein 97 containing Domain 2 (CFAP97D2) protein. The CFAP97D2 gene is also known as C17orf105 Homolog gene.[2]

mRNA

Isoforms

There are 5 mRNA transcripts produced from the CFAP97D2 gene. These transcripts encode 5 different CFAP97D2 isoforms: X1, X2, X3, 1, and 2. Isoform X1 is 166 AA, the longest isoform of Homo sapiens CFAP97D2. Isoform X2 has 2 deletirious mRNA mutations at the end of exon 5 accounting for a single amino acid deletion. The final protein transcript is a 165 amino acids. Isoform X3 has numerous mRNA insertions in Exons 3, 4, and 5. The final protein length is 101 AA. Isoforms 1 and  2 have complete deleterious mRNA mutations of Exon 4 and encode final protein lengths of 99 and 98 AA respectively.[3]

Expression

RNA sequencing revealed CFAP97D2 expression in the brain, lungs, pancreas, testis, fallopian tubes, and cervix.[4] [5] [6]

Protein

Primary Structure

Homo sapiens CFAP97D2 Isoform X1 is a basic protein with a predicted PI of 10.4 and Mw 19.3 kaD.[7] There is nuclear leucine zipper motif (AA #37-52) and nuclear localization signal (AA #102-116).[8] [9]

Secondary Structure

By definition, the leucine zipper region of Homo sapiens CFAP97D2 (AA #37-52) is an alpha helix.[10]

Tertiary Structure

CFAP97D2 is characterized by coiled-coiled regions and 2 alpha helices[11]

Homology

CFAP97 Gene Family

The CFAP97 gene family contains 3 genes: CFAP97, CFAP97D1, and CFAP97D2 and is characterized by the KIAA1430 gene domain. CFAP97D1 has longer evolutionary conservation than CFAP97D2 with an estimated date of divergence 431 MYA.

Strict Orthologs

CFAP97D2 is a highly conserved protein found in primates, rodents, bats, even-toed ungulates, otarlidae, birds, reptiles, and bony fish.[12] Primate and rodent CFAP97D2 proteins are most recently related (% identity range: 74-100%). Bats, even-toed ungulates, otariidae, and birds are moderately related to Homo sapiens CFAP97D2 (% identity range: 55.9-73%) and reptile and body fish species are most distantly related (% identity range: 25.6-42%). The asiactic toad is an outlier as a reptile with 60.4% identity with Homo sapiens CFAP97D2.

Paralogs and Distant Homologs

CFAP97D1 is conserved in both invertebrate and vertebrate species.[13] The CFAP97D1 genes found in vertebrate species are paralogs of CFAP97D2. The invertebrate CFAP97D1 genes existing prior to 431 MYA are distant CFAP97D2 homologs due to these paralogs' shared evolutionary history.

Evolution

CFAP97D1 has longer evolutionary conservation than CFAP97D2.[14] CFAP97D1's slow and consistent conservation is evidenced by its little divergence from its ortholog ancestors over 600 million years while CFAP97D2 has changed rapidly over 431 million years. Present only in vertebrate species, CFAP97D2's rapid evolution indicates that it is under different selective pressures than CFAP97D1 and therefore serves in a different functional capacity.

Notes and References

  1. Web site: CFAP97D2 CFAP97 domain containing 2 [Homo sapiens (human)] - Gene - NCBI ]. 2022-12-16 . www.ncbi.nlm.nih.gov.
  2. Web site: CFAP97D2 Gene - GeneCards C97D2 Protein C97D2 Antibody . 2022-12-16 . www.genecards.org.
  3. Web site: Human BLAT Search . 2022-12-16 . genome.ucsc.edu.
  4. Web site: CFAP97D2 CFAP97 domain containing 2 [Homo sapiens (human)] - Gene - NCBI ]. 2022-12-14 . www.ncbi.nlm.nih.gov.
  5. Web site: CFAP97D2 protein expression summary - The Human Protein Atlas . 2022-12-14 . www.proteinatlas.org.
  6. Web site: Gene Expression in 54 tissues from GTEx RNA-seq of 17382 samples, 948 donors (V8, Aug 2019) (RP11-569D9.5) . 2022-12-14 . genome.ucsc.edu.
  7. Web site: Expasy - Compute pI/Mw tool . 2022-12-14 . web.expasy.org.
  8. Web site: ELM - Search the ELM resource . 2022-12-16 . elm.eu.org . en.
  9. Web site: Motif Scan . 2022-12-16 . myhits.sib.swiss . en.
  10. Seldeen . Kenneth L. . McDonald . Caleb B. . Deegan . Brian J. . Bhat . Vikas . Farooq . Amjad . 2010-04-16 . Dissecting the Role of Leucine Zippers in the Binding of bZIP Domains of Jun Transcription Factor to DNA . Biochemical and Biophysical Research Communications . 394 . 4 . 1030–1035 . 10.1016/j.bbrc.2010.03.116 . 0006-291X . 2860604 . 20331972.
  11. Zheng . Wei . Zhang . Chengxin. Yang . Li . Pearce . Robin . Bell . Eric W. . Zhang . Yang . 2021 . Folding non-homology proteins by coupling deep-learning contact maps with I-TASSER assembly simulations. . Cell Reports Methods . 1 . 100014 . 100014 . 10.1016/j.crmeth.2021.100014 . 34355210 . 8336924 . I-Tasser.
  12. Web site: BLAST: Basic Local Alignment Search Tool . 2022-12-14 . blast.ncbi.nlm.nih.gov . [XP_016876399.1].
  13. Web site: CFAP97D1 CFAP97 domain containing 1 [Homo sapiens (human)] - Gene - NCBI ]. 2022-12-16 . www.ncbi.nlm.nih.gov.
  14. Web site: TimeTree :: The Timescale of Life . 2022-12-16 . timetree.org . en.