CXorf66
CXorf66 also known as Chromosome X Open Reading Frame 66, is a 361aa protein in humans that is encoded by the CXorf66 gene. The protein encoded is predicted to be a type 1 transmembrane protein; however, its exact function is currently unknown.
There is a patent for CXorf66 under the file US 8586006 by the Institute for Systems Biology and Integrated Diagnostics, Inc.
CXorf66 protein is a potential novel cancer biomarker.
Gene
CXorf66 is located on Chromosome X at Xq27.1 and is on the complement strand. The CXorf66 gene is located between,, and . In addition to this, according to OMIM, CXorf66 is positioned between SOX3,, and CDR1.mRNA
Splice variants
CXorf66 only consists of one known splice variant with three exons and two introns. Locations of junctions occur at 30aa and 81aa .CXorf66 has only been found to have only one polyadenylation site.
Protein
Composition
With 57 serines and 42 lysines, the CXorf66 protein is both serine and lysine rich. CXorf66 has a molecular weight of 39.9kdal and an isoelectric point of 9.89.Domains
CXorf66 protein has a predicted signal peptide from 1-19aa, a topological domain from 20-47aa, a transmembrane domain from 48-68aa, and a second topological domain from 69-361aa. A signal peptide cleavage site is predicted to occur between the 17-18aa. Upon analyzing the protein's composition and post-translational modifications, it is predicted that the first topological domain is extracellular, while the topological domain is cytoplasmic. A visual can be seen in Figure II.Three repeat motifs of DKPV, SEAK, and PKRS have been found in the human CXorf66 protein. These repeats are conserved in other primates like Gorilla gorilla gorilla and Macaca mulatta, but are not present in other mammals.
SNPs
There is one natural variant of the population at 233aa from proline to leucine in the CXorf66 protein, with proline being the ancestral encoded amino acid. No effects have been observed with this missense mutation.Interacting proteins
Based on STRING's predicted protein interaction, CXorf66 has medium level scoring for being tied to the proteins listed in Figure III. It is important to note that all proteins listed are not experimentally determined.Regulation
Transcription
Promoter
There is only one known promoter predicted by Genomatix for the CXorf66 protein on the negative strand from 139047554-139048298 that is 745bp in length. When BLAT Search Alignment was used for the CXorf66 promoter generated, numerous hits with high identity were retrieved for various genes on different chromosomes. The following are a few generated top scoring search results that share a high percent identity:| Name | Gene ID | Score | Span out of 745 | Identity | Chromosome | Strand | Start | End |
| ZBTB8A | 282 | 656 | 88.2% | 1 | - | 32994892 | 32995547 | |
| TESK2 | 263 | 624 | 90.3% | 1 | - | 45843093 | 45843716 | |
| TBCK | 244 | 639 | 91.5% | 4 | + | 107146630 | 107147268 | |
| USP48 | 241 | 631 | 89.0% | 1 | + | 22014725 | 22015355 | |
| PTPN22 | 227 | 281 | 90.0% | 1 | - | 114365307 | 114365587 | |
| PSPH | 220 | 605 | 90.6% | 7 | - | 56098319 | 56098923 |
Uniquely, TESK2 is a testis-specific protein kinase, which correlates with predicted CXorf66 tissue expression.