MGC50722

From Infogalactic: the planetary knowledge core
Jump to: navigation, search

MGC50722, also known as uncharacterized protein LOC399693, is a protein that in humans is encoded by the MGC50722 gene (Mammalian Gene Collection Project Gene 50722[1]). This 965 amino acid human protein has a molecular weight of 104.495 kDa and one domain of unknown function (DUF390).[2] Generally conserved across mammals, this quickly evolving gene shows relatively low expression in most human tissues except in the testis.[3][4]

Gene

The entire human gene is 40,364 base pairs in length, while the unprocessed mRNA is 25,960 base pairs long. After splicing of introns the 10 exon gene has a final mRNA length of 3,596 base pairs that encodes for 965 amino acids.[2][5][6]

Locus

Human MGC50722 is located on the minus strand of chromosome 9 in the region q34 of the human genome (NCBI Gene ID: 399693). The most characterized gene in this region of the human genome is GPSM1, which encodes the G-protein-signaling modulator 1 protein.[7]

Homology and evolution

Divergence of the human MGC50722 gene graphed against Fibrinogen and Cytochrome C divergence. Each data point on the graph represents a different species and that species homologous gene as identified through BLAST. BLAST searches were conducted using the human MGC50722, Fibrinogen and Cytochrome C gene and the percent identities were graphed against the actual divergence from humans for that species homologous gene.

Paralogs

It was found that the centrosome-associated protein 350 (CEP350) was the only possible paralog to protein MGC50722 in humans. CEP350 is a 3117 amino acid long protein and aligns with protein MGC50722 at its N-terminus. This indicates the paralog spacing is very distant for when MGC50722 split from CEP350.

Orthologs

Compete orthologs for protein MGC50722 are found only in mammals, where most conservation is found with in the N-terminus and DUF390.

Distant homologs

The most distant homolog detectable is in cartilaginous fish (462.5 MYA).

Homologous domains

The domain of unknown function 390 (pfam04094: DUF390) is part of a family of proteins that have only been identified within the rice genome. Although this domain’s function is unknown, it may be some kind of transposable element.[8]

Protein

Primary sequence and isoforms

Human protein MGC50722 is 104.495 kDa, with an isoelectric point of 10.24. A mixed charged cluster of amino acids is present between positions 146 and 182, which seems to be conserved in primates, but not present in other mammals. There are also 6 predicted isoforms found in human.[2]

Subcellualar localization signals

PSORTII servers predict 5 nuclear localization signals in the human protein MGC50722. When ortholog sequences to the human protein were run through PSORT II, the predicted nuclear subcellular localization was a consensus prediction.

Predicted nuclear localization signals in human protein MGC50722
Signal Type Residue Span Amino Acid Sequence
pat4 46-49 RPRK
pat4 148-151 KPKR
pat7 43-49 PQQRPRK
pat7 149-155 PKRVKSS
pat7 302-308 PSKRRLQ

Post-translational modifications

Human protein MGC50722 ortholog in mice, 4932418E24Rik protein, has experimentally determined phosphorylation sites at S588, S591, and S670 in the testis (pTestis ID: PT-MM-02686).[9][10][11] Prediction servers at ExPASy also predict more phosphorylation sites (NetPhos 2.0 Server), a N-termnial acetylation site (NetAcet 1.0 Server), glycation sites (NetGlycate 1.0 Server), and a GalNAc O-glycosylation site (NetOGlyc 4.0 Server) at conserved residues in the human MGC50722 protein.

Secondary structure

Prediction models characterized protein MGC50722 as mostly disordered, but two regions of coiled-coils.

Protein internal structure and features

Feature Residue Span
Region of Low Complexity[6] 11-26
DUF390[8][12] 405-690
Region of Low Complexity[6] 410-423
Region of Low Complexity[6] 546-556
Coiled-Coil[6] 546-566
Region of Low Complexity[6] 606-621
Coiled-Coil[6] 720-753
Region of Low Complexity[6] 771-791
Region of Low Complexity[6] 871-884
File:MGC50722 AASeq Features.png
Protein MGC50722 primary amino acid sequence and its internal features/structures.

Potential function

The function of protein MGC50722 is unknown. Given that it is preferentially expressed in the testis and appears to be subcellularly localized in the nucleus, it could play an important role in gamete cells.

Interacting proteins

Due the recent identification of this gene and its protein, interaction databases (MINT, STRING, IntAct, and BioGRID) have not identified any interactions. More data would expand the characterization of MGC50722.

Expression

Expression levels of human MGC50722 appear to low/absent in most cell types, with the highest and most abundant expression shown to be in the testis (GEO Profile IDs: 48997768 and 49895282).[13] A lung cancer study also showed that MGC50722 was expressed in CD4+ T-Cells of normal human tissue samples.[14]

Promoter

The transcriptional start site for MGC50722 aligns best with SPZ1, SORY, SP1F, and FAST[15] transcription factor binding sites.

Clinical significance

A significant GEO Profile relating to MGC50722 was a study done on male fertility in humans looking at the disease teratozoospermia (GEO Profile ID: 38113951).[13] Teratozoospermia is a condition where during the development of mature sperm cells morphology is altered, thus leading to, in some cases, male infertility.[16] Gene expression shows that in normal human subjects MGC50722 is expressed, while in subjects with teratozoospermia expression levels drop significantly or shut off.

References

  1. Lua error in package.lua at line 80: module 'strict' not found.
  2. 2.0 2.1 2.2 Lua error in package.lua at line 80: module 'strict' not found.
  3. Lua error in package.lua at line 80: module 'strict' not found.
  4. Lua error in package.lua at line 80: module 'strict' not found.
  5. Lua error in package.lua at line 80: module 'strict' not found.
  6. 6.0 6.1 6.2 6.3 6.4 6.5 6.6 6.7 6.8 Lua error in package.lua at line 80: module 'strict' not found.
  7. Lua error in package.lua at line 80: module 'strict' not found.
  8. 8.0 8.1 Lua error in package.lua at line 80: module 'strict' not found.
  9. Lua error in package.lua at line 80: module 'strict' not found.
  10. Lua error in package.lua at line 80: module 'strict' not found.
  11. Lua error in package.lua at line 80: module 'strict' not found.
  12. Lua error in package.lua at line 80: module 'strict' not found.
  13. 13.0 13.1 Lua error in package.lua at line 80: module 'strict' not found.
  14. Lua error in package.lua at line 80: module 'strict' not found.
  15. Lua error in package.lua at line 80: module 'strict' not found.
  16. Lua error in package.lua at line 80: module 'strict' not found.

Suggested reading

  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.