C8orf58


Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.

Gene

The C8orf58 gene is located on chromosome 8 at position 8p21.3. It spans a total of 4,550 base pairs and has seven exons. C8orf58 is flanked by the genes PDLIM2 and CCAR2. There are no aliases. It is defined as a protein coding gene.

mRNA

C8orf58 produces three transcript splice variants. The transcript of variant 1 represents the longest transcript and encodes the largest protein. It is 2,062 base pairs and contains seven exons. There are two other splice variants, produced by alternative splice sites.
IsoformExonsLength Features
Transcript Variant 11, 2, 3, 4, 5, 6, 72062One upstream in-frame stop codon.
Transcript Variant 21, 2, 3, 4, 5, 6, 72038Alternate in-frame splice site in the 3' coding region.
Transcript Variant 31, 2, 3, 4, 5, 61955Lacks an alternate exon, results in a frameshift in the 3' coding region.

C8orf58 has a relatively short 5’ region and a moderate 3’ region. Both the 5’ and 3’ regions contain stem loops. There is one predicted miRNA binding site that found in the 3’UTR of C8orf58.

Protein

C8orf58 protein Isoform 1 is 365 amino acids long. Isoform 2 and Isoform 3 are 357 and 300 amino acids respectively. There is a kozak consensus sequence present, which confirms it is a protein coding sequence.
C8orf58 Isoform 1 has a molecular weight of 39.7 kDa and an isoelectric point of 8.29. It is proline and arginine rich and isoleucine, asparagine, phenylalanine, and tyrosine poor.
The predicted secondary structure of the C8orf58 protein include multiple alpha helices and one beta strands.
IsoformFrom mRNA VariantLength Molecular Weight Isoelectric Point
1136539.78.30
2235738.68.30
3330032.05.82

Evolutionary history

It is part of the DUF4657 family, a family of proteins found in eukaryotes. Proteins in this family are typically between 305 and 370 amino acids in length. The Domain of Unknown Function of C8orf58 is located between amino acids 73 to 364.

Expression

According to the NCBI GEO profiles, C8orf58 is a narrowly expressed protein found in spleen, lung, thymus, prostate, and spinal cord tissue. It is constitutively expressed in these tissues.

Post-translational modification

The bioinformatic tools on Expasy were used to determine potential post translational modification sites for the C8orf58 protein. There are two predicted phosphorylation sites and one predicted sumoylation site.

Subcellular localization

According to PSORT II, C8orf58 is located in the nucleus. This is supported by the presence of a sumoylation site, which is involved in nucleic cytoplasmic transport.

Interacting proteins

Two proteins have been found to interact with protein C8orf58, CENPH and metG1, which were found using two hybrid assay and the two hybrid pooling approach respectively. CENPH plays a critical role in centromere structure, kinetochore formation, and sister chromatid separation. MetG1 is required for elongation of protein synthesis and the initiation of all mRNA translation through initiator tRNA aminoacylation.

Homology

An important paralog of this gene is ENSG00000248235. Orthologs of the human gene C8orf58 are limited to vertebrates of the animal kingdom.
Scientific nameCommon nameNCBI Accession NumberLength Date of Divergence Identity Similarity
Homo sapiensHumanNP_001013864.1365---
Gorilla gorillaGorillaXP_004046807.14399.069679.50
Marmota marmotaAlpine MarmotXP_015354979.1369906875.7
Oryctolagus cuniculusEuropean RabbitXP_008248092.1371906672
Nannospalax galiliSpalaxXP_008848689.1362906574.7
Ceratotherium simum simumWhite RhinocerosXP_014652157.1381966672.7
Odobenus rosmarus divergensPacific walrusXP_012418498.1388966574.7
Sus scrofaWild BoarXP_005670472.1382966573.3
Hipposideros armigerGreat Roundleaf BatXP_019487131.1387966271
Eptesicus fuscusBig Brown BatXP_008149784.1377966270.1
Loxodonta africanaAfrican Bush ElephantXP_003412428.13721057177.2
Orycteropus afer aferAardvarkXP_007949039.13701056571.7
Parus majorGreat TitXP_015504136.13203123235.6
Anolis carolinensisCarolina AnoleXP_008118367.14533122838.9