TRAPPC14


TRAPPC14 also known as MAP11 is a protein that in human is encoded by the gene TRAPPC14. It was previously referred to by the generic name C7orf43. C7orf43 has no other human alias, but in mice can be found as BC037034.

Gene Locus

In humans, MAP11 is located in the long arm of human chromosome 7, and is on the negative strand. Genes located around C7orf43 include GAL3ST4, LAMTOR4, GPC2. In humans, C7orf43 has 9 detected common single-nucleotide polymorphisms, all of which are located in non-coding regions and thus do not affect amino acid sequence.

mRNA

Splice variants

MAP11 encodes 2 isoforms, the longest being C7orf43 isoform 1, which is 2585 base pairs long and has with 11 exons and 10 introns. C7orf43 isoform 1 encodes a protein that is 580 amino acids long and only has one polyadenylation site. C7orf43 isoform 2 is 2085 base pairs long and encodes a protein of 311 amino acids. Two additional isoforms has been reported on several occasions, encoding for proteins with 199 and 206 amino acids.

Tissue expression

MAP11 has a widespread moderate expression with tissue to tissue variability in humans and across mammalian species. The mouse C7orf43 ortholog has been shown to be ubiquitously expressed in the brain, as well as in the mouse embryonic central nervous system.

Regulations

MAP11 has one promoter region upstream of its transcription site, as predicted by Genomatix. This promoter is 657 base pairs long and is located at position 99756182 to 99756838 in the negative strand of chromosome 7. There are several transcription factor binding sites located in this promoter, including binding sites for zinc fingers and Kruppel-like transcription factors. The top 20 transcription binding sites as predicted by the ElDorado from Genomatix is listed in the following table.
Detailed Family InformationDetailed Matrix InformationStart PositionEnd PositionAnchor PositionStrandMatrix Similarity ScoreSequence
Brachyury gene, mesoderm developmental factorT-box transcription factor TBX20617645631+1
C2H2 zinc finger transcription factors 2 KRAB-containing zinc finger protein 300596 618 607 + 1
Fork head domain factorsAlternative splicing variant of FOXP1, activated in ESCs 37 53 45 - 1
Pleomorphic adenoma genePleomorphic adenoma gene 1 411 433 422 - 1
Pleomorphic adenoma genePleomorphic adenoma gene 1 464 486 475 - 1
RNA polymerase II transcription factor II B Transcription factor II B recognition element 197 203 200 + 1
TGF-beta induced apoptosis proteinsCysteine-serine-rich nuclear protein 1 73 79 76 - 1
GC-Box factors SP1/GCStimulating protein 1, ubiquitous zinc finger transcription factor 418 434426-0.998
Human and murine ETS1 factorsEts variant 3486 506 496 - 0.996
Krueppel like transcription factorsGut-enriched Krueppel-like factor / KLF4469 485 477 - 0.994
Two-handed zinc finger homeodomain transcription factors AREB6 495 507 501 +0.994
Zinc finger transcription factor RU49, zinc finger proliferation 1 - Zipro1Zinc finger transcription factor RU49. RU49 exhibits a strong preference for binding to tandem repeats of the minimal RU49 consensus binding site.522528525 +0.994
Krueppel like transcription factorsCore promoter-binding protein with 3 Krueppel-type zinc fingers 418434 426-0.992
C2H2 zinc finger transcription factors 7 Zinc finger protein 263, ZKSCAN12 425 439432 + 0.99
C2H2 zinc finger transcription factors 6Zinc finger and BTB domain containing 7, Proto-oncogene FBI-1, Pokémon 252264258 - 0.989
Krueppel like transcription factorsKruppel-like factor 7 416 432 424 - 0.989
GC-Box factors SP1/GCSp4 transcription factor471 487 479 -0.986
Krueppel like transcription factorsGut-enriched Krueppel-like factor137153 145 + 0.986
Krueppel like transcription factorsKrueppel-like factor 2 641657649-0.986
Human and murine ETS1 factorsEts variant 16 26 16 -0.984

Protein

Composition and Domains

The human protein MAP11 has an isoelectric point of 8.94. MAP11 also has a glycine-rich region spanning amino acids 54 through 134. Analysis using the SAPS tool from the SDSC Biology Workbench showed this glycine-rich region to not be conserved in terms of specific glycine residue positions, but is well conserved in overall glycine content in mammals and reptiles, although not in bony fishes. C7orf43 is mostly uncharged, and this neutral charge distribution is conserved in mammals and reptiles, but bony fishes have at least one negative charge cluster
C7orf43 is predicted to have no signal peptide in its first 70 amino acid residues. However, it is predicted to have a vacuolar targeting motif starting at residue 258 in the human protein. This vacuolar targeting motif is shown to be conserved throughout mammals, reptiles, birds, amphibians, and bony fishes.

Evolutionary history

The MAP11 protein has no paralogs in humans. However, C7orf43 orthologs can be found to be highly conserved in mammals, reptiles, and several species of bony fishes. C7orf43 is also conserved in birds, although several bird species lack parts of the N-terminus. No C7orf43 orthologs can be found outside the animal kingdom. The following table lists representative C7orf43 orthologs across multiple animal classes.

Strict orthologs

No.SpeciesCommon nameDate of Divergence Accession No.E-valueLength Identity Similarity
1Homo sapiensHuman-NP_060745.3 0.0 580 100 100
2Pan troglodytesCommon Chimpanzee 6.3XP_009452032 0.0 580 99100
3Macaca mulattaMacaque29.0 XP_001102238 0.0 580 99 99
4Cavia porcellus Guinea pig 92.3XP_003470051 0.0 580 98 98
5Sus scrofa Wild boar 94.2 XP_003124386 0.0 580 98 99
6Odobenus rosmarus divergens Walrus94.2 XP_004399075 0.0 580 98 98
7Tursiops truncatesCommon bottlenose dolphin 94.2 XP_004315199 0.0 582 92 93
8Echinops telfairi Lesser hedgehog tenrec 98.7XP_004705644 0.0 58195 97
9Dasypus novemcinctus Nine-banded armadillo 104.2 XP_0044572340.0580 9798
10Monodelphis domestica Gray short-tailed opossum 162.6 XP_001367097 0.0 568 8992
11Chrysemys picta bellii Painted turtle 296.0 XP_008175974 0.0 572 7683
12Alligator mississippiensis American alligator 296.0 XP_006266384 0.0 582 75 82
13Pelodiscus sinensis Chinese softshell turtle 296.0 XP_006127325 0.056973 81
14Xenopus tropicalis Western clawed frog 371.2 NP_001121523 0.0 580 6474
15Oncorhynchus mykiss Rainbow trout 400.1CDQ848780.0 581 64 75
16Danio rerio Zebrafish 400.1 XP_001339329 0.0 5956374
17Oryzias latipes Japanese rice fish 400.1 XP_004076807 0.06096270
18Takifugu rubripes Pufferfish 400.1XP_003970822 0.0 6186171

Distant orthologs

No.SpeciesCommon nameDate of Divergence Accession No.E-valueLength Identity Similarity
1Nipponia Nippon Crested ibis 296.0 XP_0094723390.0503 8088
2Charadrius vociferousKilldeer 296.0 XP_009892747 0.0456 82 90
3Pseudopodoces humilisGround tit 296.0XP_0055334260.0 600 66 76
4Latimeria chalumnae West Indian Ocean coelacanth 414.9 XP_0060116123E-177 429 65 75
5Branchiostoma floridae Florida lancelet 713.2 XP_002592972 9E-67557 32 46
6Strongylocentrotus purpuratus Purple sea urchin 742.9 XP_003727419 3E-46 725 35 51
7Aplysia californicaCalifornia sea slug 782.7 XP_005113015 4E-21 692 25 39
8Nematostella vectensis Starlet sea anemone 855.3 XP_001632706 4E-19 494 2439
9Trichoplax adhaerens -- XP_0021088095E-15 645 24 41

Post-translational modifications

C7orf43 has three phosphorylated sites, Ser 517, Thr 541 and, Ser 546. All three sites are relatively well-conserved throughout mammals, reptiles, birds, amphibians, and bony fishes. The protein has no predicted N-myristoylation, as it has no N-terminal glycine. However, C7orf43 is predicted to have one N-acetylation on a serine residue at the N-terminus.

Secondary structure

The secondary structure of C7orf43 is yet to be determined. However, C7orf43 is predicted to have no transmembrane domain and to eventually be secreted from the cell. An analysis using the PELE tool from SDSC Biology Workbench predicted mostly beta sheets and random coils that are conserved throughout the strict orthologs. Similarly conserved alpha helix motifs have been predicted, one near the N-terminus and one near the C-terminus.

Clinical significance

While no studies have focused on the characterization of C7orf43, several large-scale screenings have revealed information related to C7orf43 function. A study using FLAG affinity purification mass spectrometry to profile protein interactions in the Hippo signaling pathway identified C7orf43 as one of the interacting proteins. C7orf43 was found to interact with angiomotin-like protein 2 (AMOTL2), also known as Leman Coiled-Coil Protein, a regulator of Hippo signaling. AMOTL2 is also known to be an inhibitor of Wnt signaling, a pathway with known associations to cancer development, and to be a factor for angiogenesis, a process essential to tumour maintenance and metastasis.
Several studies have linked C7orf43 to carcinomic events. Other studies have also linked C7orf43 to carcinomic events. A large-scale yeast two-hybrid experiment identified C7orf43 to be interacting with transmembrane protein 50A (TMEM50A), also known as cervical cancer gene 9 or small membrane protein 1. While the exact function of TMEM50A is unknown, it has been associated with cervical cancer.
C7orf43 has also been identified as a target gene of the transcription factor AP-2 gamma (TFAP2C). TFAP2C has been shown to be involved in the development, differentiation, and oncogenesis of mammary tissues. Specifically, TFAP2C has a role in breast carcinoma through its regulatory effect to ESR1 and ERBB2, both of which are receptors whose aberrations have been associated with breast carcinomas. TFAP2C has also been shown to have an oncogenic role by promotion of cell proliferation and tumour growth in neuroblastoma.
Through its location in the q arm of chromosome 7, C7orf43 has been linked to various diseases. Several diseases have been described as having deletions in the q arm of chromosome 7, among them are myeloid disorders, including acute myelogenous leukemia and myelodysplasia.