CLIP4


CAP-Gly Domain Containing Linker Protein Family Member 4 is a protein that in humans is encoded by the CLIP4 gene. In terms of conserved domains, the CLIP4 gene contains primarily ankyrin repeats and the eponymous CAP-Gly domains. The structure of the CLIP4 protein is largely made up of coil, with alpha helices dominating the rest of the protein. CLIP4 mRNA expression occurs largely in the adrenal cortex and atrioventricular node. The literature encompassing CLIP4's conserved domains and paralogs points toward microtubule regulation as a possible function of CLIP4.

Gene

The human CLIP4 gene, also known as Restin-Like Protein 2, is located on the plus strand of the short arm of chromosome 2 at region 2, band 3 from base pair 29,096,676 to base pair 29,189,643. CLIP4 is 92,968 base pairs in length and consists of 23 exons.

Protein

The human CLIP4 protein is 705 amino acids in length and is composed of two main types of conserved domains: Two CAP-Gly domains and numerous ankyrin repeats. The secondary structure of CLIP4 consists largely of random coil, with alpha helices as the second-most abundant structure and beta sheets as the third-most abundant structure.
The isoelectronic point of the unprocessed CLIP4 protein is slightly basic, meaning there is a slight excess of basic amino acids compared to acidic amino acids. The molecular weight is about 65 kD. The most abundant amino acid in CLIP4 is Serine, which makes up 10.7% of the protein. Aligned matching blocks of separated, tandem, and periodic repeats are found between positions 340-345 and 542-547, as well as 447-547 and 564-568. The unusual 9-figure periodic element of a singular Lysine followed by eight other amino acids occurs five times within the protein when compared to the swp23s.q dataset. Another unusual phenomenon is a 7-figure periodic element of a negatively charged amino acid followed by six other hydrophobic amino acids, which occurs six times within the protein when compared to the swp23s.q dataset. There are two instances of Serine spacing and two instances of Phenylalanine spacing that comprise unusually large distances when compared to the swp23s.q dataset.

Expression

CLIP4 RNA expression is consistently measured to a high degree in the thyroid. Additionally, high degrees of transcription occur in the adrenal cortex and atrioventricular node. The Human Protein Atlas points toward high RNA expression values in the muscle tissues, as well as some in the skin, endocrine tissues, and proximal digestive tract. Greatest protein expression values appeared in the muscle tissues as well, in addition to some in the lung, gastrointestinal tract, liver & gallbladder, and bone marrow & lymphoid tissues.
CLIP4 protein expression seems to be highly expressed during Ada3 deficiency. There also exists a higher trend towards higher CLIP4 expression in the absence of U28.

Regulation

Gene

Common transcription factor binding sites

These transcription factors were chosen and organized based on proximity to the promoter and matrix similarity.
Transcription factorDetailed Matrix InfoAnchor baseMatrix similaritySequence
NOLFEarly B-cell factor 1

17
0.98taagagTCCCcagggcagaaaca

PAX2Zebrafish PAX2 paired domain protein

180.8aagagtccccagggcagAAACaa

AP2FTranscription factor AP-2, alpha

160.98ctgcCCTGgggactc

AP2FTranscription factor AP-2, beta

160.899gagTCCCcagggcag

SORYSRY box 9, dimeric binding sites

350.768aAACAaaatccagtgagggagag

HNF6CUT-homeodomain transcription factor Onecut-2

320.827aaacaaAATCcagtgag

PAX5B-cell-specific activator protein

400.815acaaaaTCCAgtgagggagagatgcaggg

ZF16PR/SET domain 15

360.852aaatccagtgaGGGA

SORYHMGI high-mobility-group protein I, architectural transcription factor organizing the framework of a nuclear protein-DNA transcriptional complex

780.945tggaAATTttctaccttaggagc

NFATNuclear factor of activated T-cells 5

830.955ttttGGAAattttctacct

NFATNuclear factor of activated T-cells 5

830.871aggtAGAAaatttccaaaa

CEBPCCAAT/enhancer binding protein, epsilon

890.975agccttttGGAAatt

CAATCellular and viral CCAAT box

1100.91gcagCCATttaatct

CAATAvian C-type LTR CCAAT box            

1650.875cccaCCAAgcagtgg

CEBPCCAAT/enhancer binding protein, gamma

6500.866ctaaTTGCtcaacgt

CEBPCCAAT/enhancer binding protein alpha

6510.971cacgttgaGCAAtta

VTBPMammalian C-type LTR TATA box

6800.903tgctgTAAAaggcctaa

TF2BTranscription factor II B recognition element

9831ccgCGCC

TF2BTranscription factor II B recognition element

11571ccgCGCC

TF2BTranscription factor II B recognition element

12281ccgCGCC


Transcriptional

The human CLIP4 mRNA sequence has 12 stem-loop structures in its 5' UTR and 13 stem-loop structures in its 3' UTR. Of those secondary structures, there are 12 conserved stem-loop secondary structures in the 5'UTR as well as 1 conserved stem-loop secondary structure in the 3' UTR.

Protein

The human CLIP4 protein is localized within the cellular nuclear membrane. CLIP4 does not have a signal peptide due to its intracellular localization. It also does not have N-linked glycosylation sites for that same reason. CLIP4 is not cleaved. However, numerous O-linked glycosylation sites are present. A high density of phosphorylation sites are present in the 400-599 amino acid positions on the CLIP4 protein, although many are also present throughout the rest of the protein.

Function

CAP-Gly domains are often associated with microtubule regulation. In addition, ankyrin repeats are known to mediate protein-protein interactions. Furthermore, CLIP1, a paralog of CLIP4 in humans, is known to bind to microtubules and regulate the microtubule cytoskeleton. The CLIP4 protein is also predicted to interact with various microtubule-associated proteins. As a result, it is likely that the CLIP4 protein, although uncharacterized, is associated with microtubule regulation.

Interacting Proteins

The CLIP4 protein is predicted to interact with many proteins associated with microtubules; namely, MAPRE1, MAPRE2, and MAPRE3. It is also predicted to interact with CKAP5 and DCTN1, a cytoskeleton-associated protein and dynactin-associated protein respectively.

Clinical significance

Importance in various cancers

CLIP4 activity is correlated with the spread of renal cell carcinomas within the host and could therefore be a potential biomarker for RCC metastasis in cancer patients. Additionally, measurement of promotor methylation levels of CLIP4 using a Global Methylation DNA Index reveals that higher methylation of CLIP4 is associated with an increase in severity of gastritis to possibly gastric cancer. This indicates that CLIP4 could be used for early detection of gastric cancer. A similar finding was also documented for prostate cancer, in which CLIP4 was found to be hypermethylated in patients with prostate cancer.

Importance in other diseases

The presence of CLIP4 was found to be highly increased in samples with predicted severe fibrosis as a result of Chronic Hepatitis C virus. Additionally, the presence of CLIP4 as a novel self-antigen in Systemic Lupus Arythematosus points to it having a potential role in the disease mechanism.

Homology

CLIP4 orthologs

These orthologs were chosen and organized based on estimated date of divergence from the human protein as well as the global sequence identity.
Binomial NomenclatureCommon NameTaxonomic GroupEstimated DoD from Human Accession NumberSequence Length Global Sequence Identity to Human Protein Global Sequence Similarity to Human Protein
Homo sapiens HumanPrimate0AAP97312601100100
Aotus nancymaae Ma's night monkeyPrimate43.2XP_01233089570483.583.7
Sorex araneus Common shrewEulipotyphla96XP_0046200567077478.5
Antrostomus carolinensis Chuck-will's-widowAves312XP_02894299770266.575.4
Gekko japonicus Schlegel's Japanese geckoReptilia312XP_01527036670263.873.1
Rhinatrema bivittatum Two-lined caecilianAmphibians351.8XP_02944886270759.570.5
Callorhinchus milii Elephant sharkChondrichthyes473XP_00789501671552.565.6
Branchiostoma floridae Florida lanceletLeptocardii684XP_00260682448140.452.8
Saccoglossus kowalevskii Acorn wormEnteropneusta684XP_00682268664835.747.5
Ixodes scapularis Black-legged tickArachnid797XP_02983109052738.953
Limulus polyphemus Atlantic horseshoe crabArachnid797XP_0137863764623851.6
Lottia gigantea Owl limpet  Gastropods797XP_00904684366936.349.3
Mizuhopecten yessoensis Yesso scallopBivalvia797XP_02135974763335.447.2
Parasteatoda tepidariorum Common house spiderArachnid797XP_01591496661634.747.6
Aplysia californica California sea hareGastropods797XP_01294534665333.745.7
Crassostrea virginica Eastern oysterBivalvia797XP_02231587964632.745.1
Tetranychus urticae Two-spotted spider miteArachnid797XP_01579053665231.943.5
Centruroides sculpturatus Bark scorpionArachnid797XP_02322948460530.643.4
Penaeus vannamei Pacific white shrimpMalacostracans797XP_02720674668122.934
Monosiga brevicollis ChoanoflagellateChoanoflagellatea1023XP_00174858057625.340.8