C12orf50


Chromosome 12 Open Reading Frame 50 is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.

Function

The ontology points to the function of C12orf50 is to enable mRNA and protein binding. It also is involved in poly+ mRNA export from the nucleus.

Isoforms

The C12orf50 gene has 6 isoforms.
IsoformNCBI AccessionmRNA length Protein length Features
X1NM_152589.31711414Longest protein with a zinc finger
X2NM_001363616.21669375
X3XM_017018887.11940468Longest protein without a zinc finger
X4XM_017018888.21550375
X5XM_011537985.13879395Longest mRNA
X6XM_024448868.11577374

Gene expression

In an analysis of human tissues with specific expression by the genome, RNA-seq was performed on tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity protein-coding genes found that the expression of C12orf50 is very low in most human tissues with the exception of the testis. C12orf50's expression was restricted towards testis.

Protein

Uncharacterized protein Chromosome 12 Open Reading Frame 50 is a protein in humans, encoded by the C12orf50 gene. The protein accession id is Q8NA57. The protein has a length of 414aa. The predicted mass of the protein is 47.2 kDa. The protein includes a CCCH-type Zn Finger Domain. The protein has a CCCH-type Zn Finger Domain with a C-X8-C-X5-C-X3-H motif. The domain starts at the beginning of the protein and goes to the 44th amino acid. The protein also has three disordered regions from the 136th amino acid to 168th with of length of 33 aa, 297th to 333rd with a length of 37 aa, and 346th to 414th with a length of 69 aa. The predicted molecular weight is 47.3 kDa and the predicted isoelectric point is 8.79.

Structure

The predicted tertiary structure for C12orf50 has two beta-sheets towards the beginning of the protein in the zinc finger domain and a helix from 106-124aa. These are conserved throughout mammalian orthologs. There is also a large number of coiled regions. The promoter, 3' UTR region, and 5' UTR are very well conserved. There is a negative cluster before and at the beginning of the helix from amino acid 87 to 111.

Localization

There is a 47.8% probability of being in the nucleus and a 30.4% probability of being in the cytoplasm. This was confirmed by immunohistochemistry and immunofluorescence by Sigma-Aldrich showing positivity in both the nucleus and cytoplasm. There is a nuclear location signal and acidic domain. The orthologs also confirm that C12orf50 is localized in the nucleus and cytoplasm.

Protein Interactions

There are two proteins that interact with C12orf50. Glyceraldehyde-3-phosphate dehydrogenase, spermatogenic enzyme may play an important role in regulating the switch between different energy-producing pathways, and it is required for sperm motility and male fertility.

Post-translation Modifications

C12orf50 has been predicted to undergo various phosphorylation, c-mannosylation, and O-glycosylations. The phosphorylation sites are at amino acids 262, 349 and 370. The O-glycosylation sites are amino acids 139, 238, and 374. The c-mannosylation sites are amino acids 13, 102, 292, and 388.

Evolution

C12orf50 has an evolutionary rate that is close to Fibrinogen alpha, making it relatively quick. Orthologs for C12orf50 have been found in mammals, reptiles, birds, and amphibians caecilians. No orthologs were found for frogs, fish, invertebrates, or fungi. The mammalian orthologs shared the most similarity with humans with the exception of the platypus. The range of divergence from humans from mammals was 6.4-180 million years. The reptilian orthologs were the next similar and diverged around 318 million years ago. Then the birds diverged from humans at the same time as the reptiles. The least similar was the amphibian caecilians and they diverged around 351.7 million years ago.

Homology

Orthologs

C12orf50 has orthologs in mammals, aves, reptiles and caecilian amphibians. No orthologs were found in amphibian frogs, invertebrates, plants, fungi, or yeast. The table below shows some of the orthologs that can be found on BLAST.
SpeciesOrganism common nameNCBI AccessionSequence identity ! !Sequence similarityLength-
Homo sapiensHumanNP_689802.1100%100%414
Pan paniscusBonoboXP_003828373.198.8%99.3%414
Phoca vitulinaHarbor sealXP_032272023.187.7%93.0%415
Lipotes vexilliferDaiji dolphinXP_007449184.186.3%92.3%415
Gavialis gangeticusGharialXP_019370370.147.1%63.5%378
Gopherus gangeticusTortoisesXP_030404453.147.1%61.3%414
Alligator sinensisChinese alligatorXXP_006027976.144.2%60.1%363
Gallus gallusChickenXP_040518234.138.5%52.3%405
Coturnix japonicaJapanese quailXP_032299702.136.8%50.9%403
Geotrypetes seraphiniGaboon caecilianXP_033807710.139.7%55.3%409

Paralogs

C12orf50 has two paralogs: ZC3H11A and ZC3H11B. The zinc the finger domain is considered in both of the paralogs.
GeneNCBI Accession ! !Sequence similarityLength-
C12orf50NP_689802.1100%414
ZC3H11ANP_001306167.116.2%810
ZC3H11BNNP_001342386.1 15.9%805