SMC protein
SMC proteins represent a large family of ATPases that participate in many aspects of higher-order chromosome organization and dynamics. SMC proteins are widely conserved across bacteria, archaea, and eukaryotes. In eukaryotes, they function as the core ATPase subunits of large protein complexes such as condensin, cohesin, and SMC5/6.
The term SMC derives from a mutant strain of Saccharomyces cerevisiae named smc1, which was identified based on its defect in maintaining the stability of mini-chromosomes. After the gene product of SMC1 was characterized, and homologous proteins were found to be essential for chromosome structure and dynamics in many organisms, the acronym SMC was redefined to stand for "Structural Maintenance of Chromosomes".
Classification
Eukaryotic SMCs
Eukaryotes have at least six SMC proteins in individual organisms, and they form three distinct heterodimers with specialized functions:- SMC1-SMC3: A pair of SMC1 and SMC3 constitutes the core subunits of the cohesin complexes involved in sister chromatid cohesion.
- SMC2-SMC4: A pair of SMC2 and SMC4 acts as the core of the condensin complexes implicated in chromosome condensation.
- SMC5-SMC6: A pair of SMC5 and SMC6 functions as part of a yet-to-be-named complex implicated in DNA repair and checkpoint responses.
In addition to the six subtypes, some organisms have variants of SMC proteins. For instance, mammals have a meiosis-specific variant of SMC1, known as SMC1β. The nematode Caenorhabditis elegans has an SMC4-variant that has a specialized role in dosage compensation.
The following table shows the SMC proteins names for several model organisms and vertebrates:
| Subfamily | Complex | Vertebrates | D. melanogaster | C. elegans | S. cerevisiae | S. pombe | T. thermophila |
| SMC1α | cohesin | SMC1α | Smc1 | SMC-1 | Smc1 | Psm1 | Smc1 |
| SMC2 | condensin | SMC2/CAP-E | Smc2 | MIX-1 | Smc2 | Cut14 | Smc2 |
| SMC3 | cohesin | SMC3 | Smc3 | SMC-3 | Smc3 | Psm3 | Smc3 |
| SMC4 | condensin | SMC4/CAP-C | Smc4 | SMC-4 | Smc4 | Cut3 | Smc4 |
| SMC5 | SMC5/6 | SMC5 | Smc5 | SMC-5 | Smc5 | Smc5 | - |
| SMC6 | SMC5/6 | SMC6 | Smc6 | SMC-6 | Smc6 | Smc6/Rad18 | - |
| SMC1β | cohesin(meiotic) | SMC1β | - | - | - | - | - |
| SMC4 variant | condensin IDC | - | - | DPY-27 | - | - | - |
Prokaryotic SMCs
The evolutionary origin of SMC proteins is ancient, and homologs are widely conserved in both bacteria and archaea.- SMC : Many bacteria and archaea possess canonical SMC proteins that closely resemble their eukaryotic counterparts. These bacterial and archaeal SMCs form homodimers and associate with regulatory subunits to form condensin-like complexes, SMC-ScpAB. It is hypothesized that the eukaryotic ancestor possessed two types of SMC proteins: a canonical SMC and a non-canonical SMC. Gene duplications of these two ancestral types are thought to have given rise to the six SMC subfamilies present in the last eukaryotic common ancestor : SMC1–4 evolved from the canonical lineage, while SMC5 and SMC6 evolved from the non-canonical lineage.
- MukB: In some γ-proteobacteria, including Escherichia coli, SMC function is carried out by a distantly related protein called MukB. MukB also forms homodimers and, together with regulatory subunits, assembles into a MukBEF complex, which performs condensin-like functions in organizing bacterial chromosomes.
- MksB/JetC/EptC: A third type of prokaryotic SMC protein, known as MksB, has been identified in certain bacterial species. Like MukB, MksB forms a distantly-related condensin-like complex, MksBEF. More recently, a variant complex called MksBEFG, which includes a nuclease subunit MksG, has been shown to function in plasmid defense. In other bacterial lineages, orthologous systems have been identified, including JetABCD and EptABCD. These systems are collectively referred to as the Wadjet family of SMC-like complexes.
SMC-related proteins
- In eukaryotes, Rad50 is a well-known SMC-related protein involved in the repair of DNA double-strand breaks.
- In bacteria, several proteins related to DNA repair also belong to the extended SMC family, including SbcC, RecF, and RecN.
- In archaea, a subfamily known as Archaea-specific SMC-related proteins has been identified. Previously described archaeal proteins such as Sph1/2 and ClsN are now considered members of this ASRP subfamily.
Subunit composition of SMC protein complexes
| Subunit type | cohesin | condensin I | condensin II | SMC5/6 | SMC-ScpAB | MukBEF | JetABCD |
| ν-SMC | SMC3 | SMC2 | SMC2 | SMC5 | SMC | MukB | JetC |
| κ-SMC | SMC1 | SMC4 | SMC4 | SMC6 | SMC | MukB | JetC |
| kleisin | RAD21 | CAP-H | CAP-H2 | Nse4 | ScpA | MukF | JetA |
| HEAT-A | NIPBL/Pds5 | CAP-D2 | CAP-D3 | - | - | - | - |
| HEAT-B | STAG1/2 | CAP-G | CAP-G2 | - | - | - | - |
| kite-A | - | - | - | Nse1 | ScpB | MukE | JetB |
| kite-B | - | - | - | Nse3 | ScpB | MukE | JetB |
| SUMO ligase | - | - | - | Nse2 | - | - | - |
| nuclease | - | - | - | - | - | - | JetD |
All SMC dimers, whether of eukaryotic or prokaryotic origin, associate with a kleisin subunit. In condensins and cohesin, the kleisin subunit is further associated with a pair of HEAT-repeat subunits. Notably, the eukaryotic SMC5/6 complex contains "kite" subunits instead of HEAT-repeat subunits, making it structurally more similar to prokaryotic complexes such as SMC–ScpAB, MukBEF, and MksBEF. However, unlike their typically homodimeric prokaryotic counterparts, both the SMC and kite subunits in the SMC5/6 complex are heterodimeric, resulting in a more elaborate subunit architecture. The SMC5/6 complex and the Wadjet complex each possess an additional catalytic subunit: the SUMO ligase Nse2 in SMC5/6, and the nuclease JetD in JetABCD.
Molecular structure
Primary structure
SMC proteins are 1,000-1,500 amino-acid long. They have a modular structure that is composed of the following domains:- Walker A ATP-binding motif
- coiled-coil region I
- hinge region
- coiled-coil region II
- Walker B ATP-binding motif; signature motif
Secondary and tertiary structure