WRKY protein domain
The WRKY domain is found in the WRKY transcription factor family, a class of transcription factors. The WRKY domain is found almost exclusively in plants although WRKY genes appear present in some diplomonads, social amoebae and other amoebozoa, and fungi incertae sedis. They appear absent in other non-plant species. WRKY transcription factors have been a significant area of plant research for the past 20 years. The WRKY DNA-binding domain recognizes the W-box TGAC cis-regulatory element.
Structure
WRKY transcription factors contain either one or two WRKY protein domains. The WRKY protein domain is 60 to 70 amino acids long type of DNA binding domain. The domain is characterized by a highly conserved core WRKYGQK motif and a zinc finger region. The cysteine and histidine zinc finger domain occurs as a CX4-5CX22-23HXH or CX7CX23HXC type, where X can be any amino acid. The zinc finger binds a Zn+2 ion, which is required for protein function. While the WRKYGQK is highly conserved in most WRKY domains, variation in the core sequence has been documented. A frequently occurring variant of the core sequence is WRKYGKK, which is present in most plant species.The structure of the WRKY protein domain was first determined in 2005 using nuclear magnetic resonance and later by crystallography. The WRKY protein domain is a globular shape composed of five anti-parallel β-strands. The core WRKYGQK motif is found on the second β-strand. Eighteen amino acids are highly conserved in the WRKY protein domain, including the core motif, zinc-finger binding cysteines and histidines, and a triad forming a DWK salt bridge. The triad consist of a conserved tryptophan of the core motif, along with an aspartic acid four amino acids upstream and a lysine 29 amino acids downstream of it, stabilizing the entire domain. Five amino acids on the third β-strand are also well conserved in the WRKY domain. Importantly, the WRKY genes contain a conserved intron in the WRKY domain, which occurs at the location encoding for the PR of the PRSYY amino acid sequence, thus explaining the conservation of this motif.