Vorhersage, Analyse und Bedeutung charakteristischer Transkriptionsfaktor-Bindestellen und deren potentielle Masterkontrollfunktion in der transkriptionellen Genregulation
Prediction, analysis and significance of characteristic transcription factor binding sites and their potential master control function for the transcriptional gene regulation
von Martin Haubrock geb. Bennemann
Datum der mündl. Prüfung:2021-09-23
Erschienen:2021-10-19
Betreuer:Prof. Dr. Edgar Wingender
Gutachter:Prof. Dr. Edgar Wingender
Gutachter:Prof. Dr. Stephan Waack
Dateien
Name:Dissertation_Martin_Haubrock.pdf
Size:29.2Mb
Format:PDF
Description:Dissertation
Zusammenfassung
Englisch
In eukaryotes, transcriptional gene regulation is controlled by a set of transcription factors (TFs) and co-factors. Most TFs bind to specific DNA sequences. These binding sites are called transcription factor binding sites (TFBSs). A transcriptional regulatory region consists of different TFBSs that can be bound by a defined set of TFs. By protein interactions between the bound TFs and the accumulation of further TFs and co-factors, a regu- latory module is formed, which significantly influences the efficiency of transcription. Several of these modules are assembled at the actual transcription start site (TSS). A situation-specific activity of the transcription of a gene is mediated by a variable structure of these modules and their combination at the TSS. In this work, the characterizing property of a TFBS as a definitional core (seed) and its interacting TFs as master regulators of a potential regulatory module are investigated. Based on known binding site motifs (positional weighting matrices, PWMs) different analysis strategies were developed to determine seeds in various kinds of genomic sequence data. The first implemented method analyzes core binding sites for known sequence variations in genomic sequences. The second method determines seeds based on phylogenetic similarity using sequence alignments and comparative annotation. The third methodology determines defined seed binding sites in genome-wide regulatory sequence data. The importance of the core binding sites was investigated in this work in four sub-projects. Through these analyses, various properties of the module-forming TFBSs can be determined. In the first study, based on pathogenic human sequence variations, it is shown that a disease-related change in a TFBS is associated with a change in the TFs being bound at that site. This property has been proven in two different clinical studies. Seed binding sites can be used to create regulatory transcription networks (RTNs) and was investigated in the second sub-project. An RTN consists of seed binding sites and their interacting TFs. A comparison of different human tissue-specific RTNs reveals a common design principle based on the node degree distributions, although the individual seed binding sites are used in a very tissue-specific manner. The third sub-project demonstrates the importance of seeds and their active master regulators (TFs). For experimental regions actively bound by a TF, the responsible seed binding sites are shown to be of comparable quality in comparison to different cell types. This suggests that module-forming TFBSs and their interacting master regulators exhibit universal bind- ing site qualities. In the last sub-project, the special control function of the sequence-defined core binding sites in different regulatory regions is investigated. The study distinguishes between regulatory regions close to the TSS (promoters) and more distant regions (enhancers). Overall, a specific control function of sequence- defined seeds in both promoters and enhancers can be demonstrated. Furthermore, it can be shown that defined enhancer-promoter interactions are mediated by these core binding sites and their interacting master regulators directly or indirectly, based on co-factors.
Keywords: Transcription factor binding site; Transcription factor; Transcriptional gene regulation; Enhancer; Promoter