• Deutsch
    • English
  • English 
    • Deutsch
    • English
  • Login
Item View 
  •   Home
  • Naturwissenschaften, Mathematik und Informatik
  • Fakultät für Biologie und Psychologie (inkl. GAUSS)
  • Item View
  •   Home
  • Naturwissenschaften, Mathematik und Informatik
  • Fakultät für Biologie und Psychologie (inkl. GAUSS)
  • Item View
JavaScript is disabled for your browser. Some features of this site may not work without it.

Proteogenomic peptide discovery in large search spaces

by Yehor Horokhovskyi
Doctoral thesis
Date of Examination:2024-02-23
Date of issue:2025-01-30
Advisor:Dr. Juliane Liepe
Referee:Prof. Dr. Henning Urlaub
Referee:Dr. Johannes Soeding
Referee:Prof. Dr. Anne-Christin Hauschild
Referee:Prof. Dr. Jochen Rink
Referee:Dr. Alex Faesen
crossref-logoPersistent Address: http://dx.doi.org/10.53846/goediss-11048

 

 

Files in this item

Name:Dissertation_YH_2025.pdf
Size:5.73Mb
Format:PDF
Description:Dissertation
ViewOpen

The following license files are associated with this item:


Abstract

English

Novel peptide and protein discovery is of therapeutical relevance and ignited the development of methods for their identification, commonly done via mass spectrometry. The latter relies on a well-characterised proteogenomic search space, whose size is not known, especially when considering noncanonical peptides that are derived from alternative transcription and translation events. The sequence content of proteogenomic search spaces acts as an informed prior about sample composition, but if the prior assumptions are incorrect, peptide and protein identification will be compromised. We developed an automated workflow consisting of Sequoia for the creation of RNA sequencing informed and exhaustive sequence search spaces for various noncanonical peptide strata, and SPIsnake for pre-filtering and exploration of sequence search space prior to mass spectrometry searches. We applied our automated workflow to characterise the exact sizes of tryptic and nonspecific peptide sequence search spaces in a variety of definitions, their reduction when using RNA expression, their inflation by chemical post-translational modifications, and the frequency of peptide sequence multimapping to different noncanonical origins. Furthermore, we explored the application of Sequoia and SPIsnake on HLA-I immunopeptidome sequence identification, allowing us to rescue sensitivity in peptide identification when confronted with inflated search spaces. Taken together, Sequoia and SPIsnake pave the way for an educated development of methods addressing large-scale exhaustive proteogenomic discovery by exposing the consequences of database size inflation and ambiguity of peptide and protein sequence identification.
Keywords: Mass-spectrometry, Proteomics, Proteogenomics, MHC-I, Immunopeptidome
 

Statistik

Publish here

Browse

All of eDissFaculties & ProgramsIssue DateAuthorAdvisor & RefereeAdvisorRefereeTitlesTypeThis FacultyIssue DateAuthorAdvisor & RefereeAdvisorRefereeTitlesType

Help & Info

Publishing on eDissPDF GuideTerms of ContractFAQ

Contact Us | Impressum | Cookie Consents | Data Protection Information | Accessibility
eDiss Office - SUB Göttingen (Central Library)
Platz der Göttinger Sieben 1
Mo - Fr 10:00 – 12:00 h


Tel.: +49 (0)551 39-27809 (general inquiries)
Tel.: +49 (0)551 39-28655 (open access/parallel publications)
ediss_AT_sub.uni-goettingen.de
[Please replace "_AT_" with the "@" sign when using our email adresses.]
Göttingen State and University Library | Göttingen University
Medicine Library (Doctoral candidates of medicine only)
Robert-Koch-Str. 40
Mon – Fri 8:00 – 24:00 h
Sat - Sun 8:00 – 22:00 h
Holidays 10:00 – 20:00 h
Tel.: +49 551 39-8395 (general inquiries)
Tel.: +49 (0)551 39-28655 (open access/parallel publications)
bbmed_AT_sub.uni-goettingen.de
[Please replace "_AT_" with the "@" sign when using our email adresses.]