Structure of the Pol II initation complex with TFIIH and core Mediator and mechanistic implications for transcription
by Sandra Schilbach
Date of Examination:2018-02-19
Date of issue:2018-03-13
Advisor:Prof. Dr. Patrick Cramer
Referee:Prof. Dr. Patrick Cramer
Referee:Prof. Dr. Holger Stark
Referee:Prof. Dr. Marina Rodnina
Referee:Prof. Dr. Henning Urlaub
Referee:Prof. Dr. Ralf Ficner
Referee:Dr. Vladimir Pena
Files in this item
Name:Thesis_Schilbach_2017.pdf
Size:187.Mb
Format:PDF
Abstract
English
The transcription of protein coding genes is a key step in the conversion of genomic information into the content of a cell’s proteome. In eukaryotes, this process is performed by the multi-subunit DNA-dependent RNA polymerase (Pol) II and subject to intricate regulatory systems. During the consecutive phases of transcription, initiation, elongation and termination, Pol II associates with various factors that modulate its activity and thereby aid in the coordination of gene expression. Transcriptional control at the level of initiation is dominated by the general transcription factors (GTFs) TFIIA, TFIIB, TFIID, TFIIE, TFIIF and TFIIH. At the core-promoter Pol II assembles with the GTFs in an ordered manner to form the pre-initiation complex (PIC), which is competent to induce the opening of promoter DNA and to commence RNA synthesis. The subsequent transition from initiation to the elongation phase is preluded by poorly understood events that result in detachment of Pol II from the promoter (‘promoter escape’) and disassembly of the initiation machinery. Both promoter opening and escape mark pivotal initiation checkpoints that are susceptible to regulation, and for both processes an ATP-dependent involvement of the TFIIH complex has been demonstrated. TFIIH equals Pol II in size and intricacy, consists of two dissociable ‘core’ and ‘kinase’ modules, and comprises three subunits with conserved enzymatic function. Two of these, a DNA-translocase and a kinase, act at the distinct stages of initiation, respectively. In addition to the GTFs, promoter-located Pol II associates with the global coactivator Mediator. This multi-protein complex bridges between the initiation machinery and transcription factors bound at distant enhancer DNA elements and can adapt transcription in response to environmental and developmental stimuli. The ‘head’ and ‘middle’ module segments of Mediator (core Mediator, cMed) are essential and directly contact the initiation complex. Mediator promotes initiation by cooperatively enhancing the recruitment of Pol II and the GTFs to the core-promoter, stabilizing the assembled PIC and stimulating TFIIH kinase activity. Owing to the sheer size and complexity of Mediator and TFIIH, however, the underlying molecular mechanisms that govern the dynamic progression of the initiation machinery through the stages of PIC formation, promoter opening and promoter escape remain largely elusive. An extensive structural characterization of initiation complexes containing these factors may contribute to deduce their essential functions in detail but to date has been impeded by their flexibility and poor biochemical stability, as well as by the lack of highly pure samples for analysis. In an effort to overcome such limitations, the first protocol for the large-scale preparation of recombinant full-length TFIIH from the yeast Saccharomyces cerevisiae was established. This work describes effective co-expression approaches in Escherichia coli and insect cell systems that yielded various TFIIH subcomplexes. Using purified TFIIH core and kinase modules, the assembly of complete 10-subunit TFIIH was demonstrated. In addition, this work reports the in vitro reconstitution and cryo-electron microscopy (EM) analysis of the yeast PIC-cMed complex, a macromolecular 46-subunit assembly of ~2 MDa, which encompasses all initiation-related proteins essential for cell viability in yeast. Reconstructions of the TFIIH-comprising PIC and PIC-cMed complex were derived at nominal resolutions of 4.7 Å and 5.8 Å, respectively. The obtained cryo-EM maps reveal secondary structure throughout and provide the first visualization of TFIIH within the initiation machinery at high resolution. To facilitate model building and placement into the density, crosslinking experiments were conducted, which in particular aided to identify interactions between TFIIE and TFIIH. Based on previously determined models for the cMed complex and the PIC without TFIIH (core PIC, cPIC), on newly generated homology and ab initio models for multiple domains in TFIIE and TFIIH, and on de-novo built segments, the to date most complete and accurate structure of a transcription initiation complex was compiled. The remaining, unassigned protein sequences largely comprise regions that are predicted to be disordered and thus likely adopt flexible conformations within the structure. The PIC and PIC-cMed models reveal interactions between cMed, the cPIC and TFIIH, and in particular demonstrate how TFIIE anchors TFIIH to the cPIC. The subunit arrangement within the cPIC and cMed, as well as previously proposed interfaces between cMed, the Rpb4/7 stalk and the foot domain of Pol II, and TFIIB were confirmed and further explicated. Upon binding to the PIC, the cMed middle module undergoes significant rearrangements, which result in partial loss of its contacts to the head module and in ‘opening’ of the cMed structure. The previously observed ‘cradle’ that is formed by the cPIC and cMed and may accommodate the Cterminal domain (CTD) of Pol II is further defined by TFIIH. The core-TFIIH module emanates from the cPIC in a crescent-like shape, with its ATPases Ssl2 (XPB in human) and Rad3 (XPD) at the ends of the lobes and in proximity to the cPIC. Rad3 is anchored to the cPIC by a TFIIH kinase module subunit, Tfb3 (MAT1), and Ssl2 engages with downstream promoter DNA. The ATPases are connected via the scaffolding subunits Ssl1 (p44), Tfb2 (p52), Tfb4 (p34) and Tb5 (p8). The previously uncharacterized subunit Tfb1 (p62) serves as a flexible tether between various parts of core-TFIIH. Tfb3 directly contacts Pol II by binding in a groove between the Pol II stalk and the large TFIIE subunit Tfa1 (TFIIEα). To accommodate Tfb3, the stalk of Pol II is shifted in both structures. Additional interactions between TFIIH and the cPIC involve the Cterminal section of Tfa1. Several novel Tfa1 elements that bind the TFIIH subunits Tfb1 and Ssl2 were detected, thus indicating a mechanism for TFIIE-mediated TFIIH recruitment to the PIC. The structures also provide further information on the function of the catalytic TFIIH subunits during promoter opening and escape. The ATPase Ssl2, which is implicated in the unwinding of promoter DNA, is contacted by the newly assigned ‘E-bridge’ helix in Tfa1, suggesting a concept for TFIIE-stimulated DNA opening. Comparison of the Ssl2 ATPase domains to those of enzymes, which translocate on double-stranded DNA (dsDNA), revealed striking similarity and a respective Ssl2 translocation mechanism was proposed, consistent with biochemical evidence. The kinase module of TFIIH comprises the kinase Kin28 (CDK7), which phosphorylates the Pol II CTD and stimulates promoter escape. Except for the N-terminus of Tfb3, which anchors it to Rad3 and the cPIC, this module is flexible in the PIC structure. However, it adopts a preferred position outside of the cradle and in proximity to Mediator in the PIC-cMed complex. Moreover, it is located adjacently to one of two openings in the cradle that emerge after conformational rearrangement of cMed upon its PIC incorporation, thereby implying how Kin28 may access the CTD and how Mediator may enhance its kinase activity.
Keywords: transcription initiation; RNA polymerase II; Mediator; TFIIH; macromolecular complex; cryo-EM; XL-MS