C15orf52
Chromosome 15 open reading frame 52 is a human protein encoded by the C15orf52 gene, its function is poorly understood.
Gene
C15orf52 is a gene located on the reverse strand of chromosome 15 in the species Homo sapiens at locus 15q15.1. The gene is 9,516 base pairs long including introns and exons. The gene contains 12 distinct introns, 11 exons, produces 7 different mRNAs, and 6 alternatively spliced variants.Promoter
The promoter region upstream of the gene contains several transcription factors that regulate the expression of the C15orf52 gene.mRNA
The linear mRNA is 5344 base pairs long. The mRNA contains a short 5' untranslated region of 15 base pairs and a long 3' untranslated region of 3782 base pairs. In the long 3' untranslated region, three specific miRNA binding sites were found for has-miR-147b, hsa-miR-203a-3p.1, and has-miR-214-5p miRNAs.Protein
General Properties
The protein contains a domain of unknown function. The protein, C15orf52, is a 534 amino acid long protein weighing 57.325 kDa found in Homo sapiens.Structure
Primary
Comparison of the amino acid composition to "Homo sapiens" revealed certain amino acids with differing frequencies than other proteins in humans. Phenylalanine, Tyrosine, and Asparagine were all found in lower frequencies than other proteins in humans. Glycine and Arginine were found at higher frequencies than other proteins in humans. The isoelectric point of the protein is 9.457, indicating a basic protein at a normal physiological pH of 7.4.Secondary
C15orf52 has a coiled coil domain spanning amino acids 60-97 containing alpha helices.Tertiary
The tertiary structure of this protein is still unknown to the scientific community and is often up for debate.Subcellular Localization
There are no transmembrane sequences detected in the C15orf52 protein. C15orf52 is also predicted to be a non-cytoplasmic soluble protein likely to be found as a nuclear protein.Post-Translational Modifications
The protein has been experimentally observed with phosphorylation at serines found at two locations, S201 and S392. N-terminal acetylations, C-glycosylations, glycations, leucine rich nuclear export signals, sumoylation, and PEST motifs were all predicted across orthologs for this protein.Interacting Proteins
Two proteins, THO complex subunit 1 and THO complex subunit 7 were found to interact with C15orf52 using anti-tag coimmunoprecipitation. THOC1 is a component of the THO subcomplex of the TREX complex that is thought to couple mRNA transcription, processing and nuclear export. It is also involved in an apoptotic pathway characterized by activation of caspase-6. THOC7 is also part of the same subcomplex and is required for efficient export of polyadenylated RNA.Ring finger protein 2 and SUZ12 polycomb repressive complex 2 subunit were also indicated as interacting proteins. RNF2 is part of a polycomb group of proteins that are important for transcription repression of various genes. It also possess ubiquitin ligase activity. SUZ12 is also a polycomb group protein and part of a complex that methylates lysines of histones and also is involved with repression of genes.
Homology
Paralogs
There are no known complete paralogs for the C15orf52 protein. There is a homologous domain found in Coiled Coil Domain Containing Protein 9 that is paralogous to the C15orf52 protein from amino acid 9 to 55 of CCDC9. This domain is found in primates to mollusks. This CCDC9 domain is not found in any unicellular organisms or multicellular organisms more distant than mollusks.Orthologs
Orthologs of the C15orf52 protein were traced back to cartilaginous fishes. None were found in any multicellular organisms more distant than cartilaginous fishes or unicellular organisms.| Common name | Genus & Species | Date of Divergence from Humans | Accession number | Sequence length | Sequence identity to Humans ! !Sequence similarity to Humans | - |
| Human | Homo sapiens | 0 | NP_997263.2 | 534 | 100% | 100% |
| Brandt's bat | Myotis brandtii | 97.5 | XP_005860303.2 | 564 | 76% | 79% |
| Cattle | Bos taurus | 97.5 | XP_015328613.1 | 577 | 75% | 77% |
| Mouflon | Ovis musimon | 97.5 | XP_014962253.1 | 513 | 69% | 74% |
| House mouse | Mus musculus | 90.5 | NP_001001982.2 | 545 | 63% | 71% |
| Gekko | Gekko japonicus | 320.5 | XP_015282702.1 | 591 | 41% | 55% |
| Zebra finch | Taeniopygia guttata | 320.5 | XP_012429790.1 | 625 | 41% | 57% |
| Carolina anole | Anolis carolinensis | 320.5 | XP_008115041.1 | 496 | 39% | 55% |
| Green sea turtle | Chelonia mydas | 320.5 | XP_007069465.1 | 743 | 39% | 57% |
| Chicken | Gallus gallus | 320.5 | XP_004941352.2 | 637 | 38% | 54% |
| Golden eagle | Aquila chrysaetos canadensis | 320.5 | XP_011595804.1 | 647 | 38% | 53% |
| Western clawed frog | Xenopus tropicalis | 355.7 | XP_004917355.1 | 507 | 37% | 54% |
| Common garter snake | Thamnophis sirtalis | 320.5 | XP_013925154.1 | 586 | 37% | 52% |
| Mexican tetra | Astyanax mexicanus | 429.6 | XP_007230442.1 | 354 | 37% | 52% |
| Spotted gar | Lepisosteus oculatus | 429.6 | XP_015206400.1 | 674 | 37% | 52% |
| Common starling | Sturnus vulgaris | 320.5 | XP_014734365.1 | 646 | 37% | 53% |
| Chinese alligator | Alligator sinensis | 320.5 | XP_014372849.1 | 504 | 37% | 54% |
| Burmese python | Python bivittatus | 320.5 | XP_007429068.1 | 587 | 37% | 53% |
| Zebra fish | Danio rerio | 429.6 | XP_001337385.3 | 516 | 32% | 51% |
| Australian ghostshark | Callorhinchus milii | 482.9 | XP_007891400.1 | 692 | 29% | 45% |
| Pufferfish | Takifugu rubripes | 429.6 | XP_011614636.1 | 525 | 35% | 51% |