Show simple item record

Digitization of High-Stakes Exams

Empirical Insights and Design Recommendations for the Digital Execution and Scoring of Exams

dc.contributor.advisorSchumann, Matthias Prof. Dr.
dc.contributor.authorHartmann, Philipp
dc.date.accessioned2023-08-31T16:41:31Z
dc.date.available2023-09-07T00:50:11Z
dc.date.issued2023-08-31
dc.identifier.urihttp://resolver.sub.uni-goettingen.de/purl?ediss-11858/14861
dc.identifier.urihttp://dx.doi.org/10.53846/goediss-10080
dc.format.extent158de
dc.language.isoengde
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject.ddc330de
dc.titleDigitization of High-Stakes Examsde
dc.title.alternativeEmpirical Insights and Design Recommendations for the Digital Execution and Scoring of Examsde
dc.typecumulativeThesisde
dc.contributor.refereeSchumann, Matthias Prof. Dr.
dc.date.examination2023-08-29de
dc.description.abstractengThe opportunities for digitization in education have been addressed in research and practice for a long time. These extend across all components of the Curriculum-Instruction-Assessment (CIA) triad according to PELLEGRINO (2010). Thus, digitization is not only changing the required competencies of future workers, but also the way of teaching, learning, and testing. This development is further accelerated by the technological progress in the field of artificial intelligence (AI). In this context, both the STÄNDIGE KONFERENZ DER BILDUNGS- UND KULTUSMINISTER (2022) and the STÄNDIGE WISSENSCHAFTLICHE KOMMISSION (2022) point to the need for increased addressing of digital assessment. A literature analysis conducted as part of this dissertation shows that current research on digital exam execution often focuses on the usage perspective. Thus, primarily an isolated consideration of individual factors influencing the examinees (e.g., stress, familiarity, etc.) takes place. A comprehensive consideration of potential interrelationships between these factors is largely omitted. In the case of digital exam scoring, the use of AI is said to have a high potential in essay scoring. It is shown that currently only the scoring accuracy, but not the design of essay scoring systems, is addressed. This purely technical focus also means that the user perspective (e.g., trust) is not taken into account. Building on these findings, five studies on digital exam execution and scoring are conducted in this cumulative dissertation. In conjunction with the findings from the literature analysis, a total of 13 recommendations for practice were derived based on the results of these five studies. These show that examiners can address usage-oriented factors even before digital exams are conducted. This can reduce the influence of construct-irrelevant factors on test results and thus increase test quality. In the area of digital exam scoring, it is shown that despite technological advances, human scoring involvement can increase confidence in AI-based scorings. Based on these findings, specific design recommendations for semi-automatic AI-based scoring systems are derived. This simplifies the general transfer of technical research results on AI-based exam scoring into productive systems. Finally, further starting points for future research are derived. In particular, the development of large language models (LLM) is expected to have potential.de
dc.contributor.coRefereeSeeber, Susan Prof. Dr.
dc.contributor.thirdRefereeTrenz, Manuel Prof. Dr
dc.subject.engeducationde
dc.subject.engdigital assessmentde
dc.subject.enghigh-stakes examsde
dc.identifier.urnurn:nbn:de:gbv:7-ediss-14861-8
dc.affiliation.instituteWirtschaftswissenschaftliche Fakultätde
dc.subject.gokfullWirtschaftswissenschaften (PPN621567140)de
dc.description.embargoed2023-09-07de
dc.identifier.ppn1858637589
dc.notes.confirmationsentConfirmation sent 2023-08-31T19:45:01de


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record