LRRC40 explained

Leucine rich repeat containing 40 (LRRC40) is a protein that in humans is encoded by the LRRC40 gene.[1]

Species distribution

LRRC40 is conserved throughout all of its orthologs. The entire protein is highly conserved in mammals, while conservation is high within the leucine rich repeats in the rest of the orthologs.[2] Orthologs were found all the way back to the scarlet sea anemone and homologs were found in bacteria and Archaea using BLAST.[3] The following table gives information on the homologs of LRRC40.

Organism common name Divergence from humans (MYA) [4] NCBI mRNA accession Sequence similarity Protein length Common gene name
Homo sapiens[5] Humans -- NM_017768 100% 602 LRRC40
Pan troglodytes[6] Common chimp 6.4 XM_513483 99% 602 Hypothetical protein
Pongo abelii [7] Orangutan 15.8 NM_001131180 99% 602 LRRC40
Macaca fascicularis [8] Long-tailed macaque 30.2 AB179219 99% 602 Full LRRC40
Callithrix jacchus [9] Common marmoset 43.9 XM_002750952.1 99% 602 Predicted: LRRC40
Sus scrofa [10] Wild boar 92.5 XM_003127928 96% 602 Predicted: LRRC40 like protein
Mus musculus [11] Mouse 94.1 NM_024194 92% 602 LRRC40
Monodelphis domestica [12] Opossum 160.2 XM_001379417 86% 598 Hypothetical protein
Gallus gallus [13] Chicken 274.8 NM_001031295 85% 603 LRRC40
Taeniopygia guttata [14] Zebra finch 274.8 XM_002188367 85% 605 Predicted: LRRC40
Xenopus (Silurana) tropicalis [15] Western clawed frog 389.7 NM_001011310 80% 605 LRRC40
Danio rerio [16] Zebrafish 444.3 NM_199862 83% 601 LRRC40
Salmo salar [17] Salmon 444.3 BT043621 82% 600 LRRC40
Nematostella vectensis [18] Scarlet sea anemone 830.3 XM_001640230 66% 602 Predicted protein
Culex quinquefasciatus [19] Southern house mosquito 838.3 XM_001842697.1 58% 612 LRRC40

Gene

LRRC40 is located on the negative DNA strand (see Sense (molecular biology)) of chromosome 1 from 70,611,483- 70,671,223.[20] The gene produces a 2958 base pair mRNA. There are 15 predicted exons in the human gene [5] with four other splice patterns predicted on GeneCards by the Alternative Splice Database.[21]

Gene neighborhood

LRRC40 is neighbored downstream by LRRC7 (70,225,888 - 70,587,570) on the positive DNA strand and upstream by SRSF11 (70,687,320-70,716,488) on the positive DNA strand.

Gene expression

LRRC40 is expressed between the 50th and 100th percentile in almost every tissue in the body.[22]

Protein

While the exact function of the LRRC40 protein is not yet understood, it is believed to participate in protein-protein interactions because it is a member of the leucine rich repeat family of proteins which are known to participate in protein-protein interactions.[23]

Properties

LRRC40 is a 602 amino acid protein with a molecular weight of 68.254 kDa and an isoelectric point of 6.04.[24] LRRC40 is expected to localize to the nucleus[25] and has no transmembrane domains to anchor it to the nuclear membrane. LRRC40 has many predicted phosphorylation sites. Of the 19 predicted phosphoserine sites, only two are conserved within the orthologs.[26] These two sites are S38 and S391.

Protein structure

The secondary structure of the protein has a pattern within the leucine repeat regions. Each leucine repeat has a β-sheet and α-helix. The image to the right shows the particular horseshoe-like structure of a protein with many leucine rich repeats. Depending on the area where the LRRs are located, other proteins can bind within the curve of the horseshoe or attach to the outside of the protein.

Protein interactions

According to Genecards, LRRC40 has 756 possible protein interactions.[21] These interactions are based on results in the Molecular Interaction database which provided two possible protein interactions. The two proteins are described in the table below.

Abbreviation Protein name NCBI protein accession Cellular location Function
Cell division cycle 5-like protein NP_001244 nucleus transcription regulation and mRNA processing [27]
Ski-interacting protein NP_036377.1 nucleus mRNA processing [28]

Notes and References

  1. Web site: Entrez Gene: leucine rich repeat containing 40.
  2. Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, Higgins DG, Thompson JD . Multiple sequence alignment with the Clustal series of programs . Nucleic Acids Res. . 31 . 13 . 3497–500 . July 2003 . 12824352 . 168907 . 10.1093/nar/gkg500.
  3. Web site: NCBI BLAST.
  4. Web site: Time Tree.
  5. Web site: NCBI Nucleotide: NM_017768.4. 24 June 2018.
  6. Web site: NCBI Nucleotide: XP_513483. 20 March 2018.
  7. Web site: NCBI Nucleotide: NM_001131180. 19 February 2022.
  8. Web site: NCBI Nucleotide: AB179219. 6 October 2006.
  9. Web site: NCBI Nucleotide: XM_002750952.1. 18 May 2010.
  10. Web site: NCBI Nucleotide: XM_003127928. 13 May 2017.
  11. Web site: NCBI Nucleotide: NM_024194. 13 August 2022.
  12. Web site: NCBI Nucleotide: XM_001379417. 27 April 2016.
  13. Web site: NCBI Nucleotide: NM_001031295. 9 March 2022.
  14. Web site: NCBI Nucleotide: XM_002188367. 12 February 2013.
  15. Web site: NCBI Nucleotide: NM_001011310 . 19 June 2021.
  16. Web site: NCBI Nucleotide: NM_199862 . 20 November 2021.
  17. Web site: NCBI Nucleotide: BT043621 . 24 November 2009.
  18. Web site: NCBI Nucleotide: XM_001640230 . 31 January 2009.
  19. Web site: NCBI Nucleotide: XM_001842697.1 . December 2009.
  20. Web site: NCBI Gene: 55631.
  21. Web site: GeneCards: LRRC40.
  22. Web site: GEO Profiles: LRRC40 GDS596.
  23. Kobe B, Kajava AV . The leucine-rich repeat as a protein recognition motif . Curr. Opin. Struct. Biol. . 11 . 6 . 725–32 . December 2001 . 11751054 . 10.1016/S0959-440X(01)00266-4.
  24. Web site: ExPASy: Compute PI/Mw. https://web.archive.org/web/20030723023847/http://www.expasy.org/cgi-bin/pi_tool. 2003-07-23. dead.
  25. Web site: PSORTII: Protein Localization Tool.
  26. Web site: NetPhos 2.0 Server: Phosphorylation Prediction.
  27. Web site: MINT: CDC5L. https://archive.today/20130218153409/http://mint.bio.uniroma2.it/mint/search/interactor.do?interactorAc=MINT-133723&dataSet=&. 2013-02-18. dead.
  28. Web site: MINT: SNW1. https://archive.today/20130218185926/http://mint.bio.uniroma2.it/mint/search/interactor.do?interactorAc=MINT-193944&dataSet=&. 2013-02-18. dead.