Cohesin
Cohesin is a protein complex that mediates sister chromatid cohesion, homologous recombination, and DNA looping. Cohesin is formed of SMC3, SMC1, SCC1 and SCC3. Cohesin holds sister chromatids together after DNA replication until anaphase when removal of cohesin leads to separation of sister chromatids. The complex forms a ring-like structure and it is believed that sister chromatids are held together by entrapment inside the cohesin ring. Cohesin is a member of the SMC family of protein complexes which includes Condensin, MukBEF and SMC-ScpAB.
Cohesin was separately discovered in budding yeast both by Douglas Koshland and Kim Nasmyth in 1997.
Structure and subunits
Cohesin is a multi-subunit protein complex, made up of SMC1, SMC3, RAD21 and SCC3. SMC1 and SMC3 are members of the Structural Maintenance of Chromosomes family. SMC proteins have two main structural characteristics: an ATP-binding cassette-like 'head' domain with ATPase activity and a hinge domain that allows dimerization of SMCs. The head and the hinge domains are connected to each other via long anti-parallel coiled coils. The dimer is present in a V-shaped form, connected by the hinges.The N-terminal domain of RAD21 contains two α-helices which forms a three helix bundle with the coiled coil of SMC3. The central region of RAD21 is thought to be largely unstructured but contains several binding sites for regulators of cohesin. This includes a binding site for SA1 or SA2, recognition motifs for separase cleavage and a region that is competitively bound by PDS5A, PDS5B or NIPBL. The C-terminal domain of RAD21 forms a winged helix that binds two β-sheets in the Smc1 head domain.
Once RAD21 binds the SMC proteins, SCC3 can also associate with RAD21. When RAD21 binds on both SMC1 and SMC3, the cohesin complex forms a closed ring structure. The interfaces between the SMC subunits and RAD21 can open to allow DNA to pass in and out of the cohesin ring.
A structure of the entire cohesin complex has been solved using cryo-electron microscopy. Key findings from the structural studies include:
- Ring Architecture. Cohesin forms a tripartite ring where the SMC proteins dimerize via their hinge domains and are bridged by RAD21.
- ATPase Domain. The head domains of SMC proteins contain ATPase sites that drive cohesin's dynamic interactions with DNA, mostly loop extrusion.
- Conformational Flexibility. Cohesin is a highly dynamic protein, which exist in the open and closed conformations, interchanging by bending in the so-called elbows.
- Regulatory Interactions. Accessory proteins like WAPL and PDS5 have been visualized interacting with the cohesin complex, elucidating their roles in unloading and stabilizing cohesin on chromatin.
Function
Role in chromatin compaction and global organization
- DNA compaction. Cohesin helps to compact DNA by creating the loops of DNA in an ATP-dependent manner via the process called loop extrusion.
- Segregation of chromosomes. Cohesin's activity promotes segregation of chromosomes into chromosomal territories.
Role in cell division
- During mitosis:
- # Maintains sister chromatid cohesion. Cohesin keeps the sister chromatids connected during metaphase ensuring that each sister chromatid segregates to opposite poles during cell division. Without cohesin, the cell would be unable to control sister chromatid segregation since there would be no way of ensuring whether the spindle fiber—attached on each sister chromatid—is from a different pole. Other proteins modulate cohesin function by regulating this process, such as PDS5A, PDS5B, NIPBL and ESCO1 in mammalian cells.
- # Helps to assemble bipolar spindle apparatus. Cohesin ensures the attachment of spindle microtubules and sister kinetochores onto the chromosomes. This is tightly related to the correct sister chromatid segregation towards the two spindle poles. Dysregulation of this process leads to premature chromosomes separation and multipolar spindle-formation. The proteins Shugoshin 1, Rae1 and NuMA are associated with cohesin in this assembly process.
- During meiosis, cohesin recruits additional meiotic-specific component Rec8, that is an essential player in these three processes:
- # Sister chromatids cohesion.
- # Homologous chromosomes pairing.
- # Recombination during meiosis.
Role in regulation of gene expression
Cohesin might play an important role in regulation of gene expression through the following mechanisms:- Cohesin mediating enhancer-promoter interactions by bridging them in cis.
- Cohesin connecting CTCF sites in cis by interacting with CTCFs in a highly specific and oriented manner.
- Cohesin creating regulatory TADs which are the environments for promoter-enhancer interactions.
Other functions
- Cohesin has been found to be essential for embryo development. Cohesin knockout is lethal for development of mouse and zebrafish.
- Cohesin deletion in mature macrophages leads to impairment of inflammatory response of the innate immune system and restricts transcriptional response of primary macrophages to microbial signals.
Localization on DNA
- Accumulation at CTCF sites: This happens due to direct interaction of cohesin subunits SA2 and SCC1 with CTCF. Briefly, in the process of loop extrusion, cohesin moves actively along the two DNA double helices, translocating one of them with respect to the other. Thus, the loop can become smaller or larger. The loop extrusion process stops when cohesin encounters the architectural chromatin protein CTCF. The CTCF site needs to be in a proper orientation to stop cohesin.
- Accumulation at promoters: Two hypotheses were proposed to explain accumulation of cohesin at the gene promoters:
- Cohesin location is influenced by the orientation of neighboring genes and it is most frequently located in areas of convergent transcription. Gene orientation depends on the direction of transcription and can be of three types: head-to-head, head-to-tail and tail-to-tail. The tail-to-tail configuration results in the convergence of transcription machinery. One hypothesis states that the RNA polymerase "pushes" cohesin along the DNA, causing them to move towards the direction of the RNA polymerases. Changing the transcription pattern of genes changes the location of cohesin indicating that localization of cohesin may depend on transcription.
- In another model, chromatin loop extrusion is pushed by transcription generated supercoiling ensuring also that cohesin relocalizes quickly and loops grow with reasonable speed and in a good direction. In addition, the supercoiling-driven loop extrusion mechanism is consistent with earlier explanations proposing why topologically associating domains flanked by convergent CTCF binding sites form more stable chromatin loops than TADs flanked by divergent CTCF binding sites. In this model, the supercoiling also stimulates enhancer promoter contacts and it is proposed that transcription of eRNA sends the first wave of supercoiling that can activate mRNA transcription in a given TAD.
- Accumulation at AT-rich sequences: Cohesins can be frequently found in chromosome arms that have AT-rich DNA sequences indicating that DNA sequence may be an independent factor of cohesin binding.
- Accumulation at centromeres: Cohesin rings, especially in budding yeast, are also located in the region surrounding the centromere. Two hypotheses may explain this: the presence of repetitive heterochromatic DNA in centromeres and the presence of chromosome-associated proteins. For example, Schizosaccharomyces pombe have multiple copies of specific heterochromatic DNA whose involvement in cohesion binding has been proven. Budding yeast lacks repetitive sequences and, therefore, requires a different mechanism for cohesion binding. Evidence suggests that binding of cohesin to the budding yeast centromere region depends on chromosome-associated proteins of the kinetochore that mediate cohesion association to pericentric regions.
Cohesin in sister chromatid cohesion
Mechanism of sister chromatid cohesion
It is not clear how the cohesin ring links sister chromatids together. There are two possible scenarios:- Cohesin subunits bind to each sister chromatid and form a bridge between the two.
- Since cohesin has a ring structure, it is able to encircle both sister chromatids.
Early studies suggested various ways in which cohesin may entrap DNA, including as a monomer that holds both homologues together, and a "hand-cuff" model where two intertwining cohesin complexes each hold one sister chromatid. While some studies support the idea of a hand-cuff model, the model is inconsistent with a number of experimental observations, and is generally considered to entrap chromatin as a monomer.
Even though the ring hypothesis appears to be valid, there are still questions about the number of rings required to hold sister chromatids together. One possibility is that one ring surrounds the two chromatids. Another possibility involves the creation of a dimer where each ring surrounds one sister chromatid. The two rings are connected to each other through formation of a bridge that holds the two sister chromatids together.
The topology and structure of these subunits has been best characterized in budding yeast, but the sequence conservation of these proteins and biochemical and electron microscopic observations imply that cohesin complexes in other species are very similar in their structure .
The cohesin complex is established during the initial stages of S-phase. The complexes associate with chromosomes before DNA replication occurs. Once cells start replicating their DNA, cohesin rings close and link the sister chromatids together. Cohesin complexes must be present during S-phase in order for cohesion to take place. It is unclear, however, how cohesin is loaded on the chromosomes during G1. There are two proposed hypotheses so far:
- The ATPase domain of the SMC proteins interacts with DNA and this interaction initially mediates the loading of cohesin complexes on chromosomes.
- Several proteins aid in the loading process. For example, Scc2 and Scc4 are both required for cohesin to load in budding yeast.