Bias Assessments of Benchmarks for Link Predictions over Knowledge Graphs

Show simple item record

dc.identifier.uri https://www.repo.uni-hannover.de/handle/123456789/13710
dc.identifier.uri https://doi.org/10.15488/13600
dc.contributor.advisor Russo, Mayra
dc.contributor.advisor Vidal, Maria-Esther
dc.contributor.author Sawischa, Sammy Fabian eng
dc.contributor.other TIB (Technische Informationsbibliothek)
dc.date.accessioned 2023-05-08T08:20:48Z
dc.date.available 2023-05-08T08:20:48Z
dc.date.issued 2023-04-26
dc.identifier.citation Sawischa, Sammy Fabian: Bias Assessments of Benchmarks for Link Predictions over Knowledge Graphs. Hannover : Gottfried Wilhelm Leibniz Universität, Bachelor Thesis, 2023, X, 59 S. DOI: https://doi.org/10.15488/13600 eng
dc.description.abstract Link prediction (LP) aims to tackle the challenge of predicting new facts by reasoning over a knowledge graph (KG). Different machine learning architectures have been proposed to solve the task of LP, several of them competing for better performance on a few de-facto benchmarks. The problem of this thesis is the characterization of LP datasets regarding their structural bias properties and their effects on attained performance results. We provide a domain-agnostic framework that assesses the network topology, test leakage bias and sample selection bias in LP datasets. The framework includes SPARQL queries that can be reused in the explorative data analysis of KGs for uncovering unusual patterns. We finally apply our framework to characterize 7 common benchmarks used for assessing the task of LP. In conducted experiments, we use a trained TransE model to show how the two bias types affect prediction results. Our analysis shows problematic patterns in most of the benchmark datasets. Especially critical are the findings regarding the state-of-the-art benchmarks FB15k-237, WN18RR and YAGO3-10. eng
dc.language.iso eng eng
dc.publisher Hannover : Gottfried Wilhelm Leibniz Universität
dc.rights CC BY 3.0 DE eng
dc.rights.uri http://creativecommons.org/licenses/by/3.0/de/ eng
dc.subject Link prediction eng
dc.subject Benchmarks eng
dc.subject Knowledge graphs eng
dc.subject Sample selection bias eng
dc.subject Test leakage bias eng
dc.subject Machine Learning eng
dc.subject Link-Vorhersagen ger
dc.subject Benchmarks ger
dc.subject Wissensgraphen ger
dc.subject Stichprobenverzerrung ger
dc.subject Informationsleck ger
dc.subject Maschinelles Lernen ger
dc.subject.ddc 004 | Informatik eng
dc.title Bias Assessments of Benchmarks for Link Predictions over Knowledge Graphs eng
dc.type BachelorThesis eng
dc.type Text eng
dcterms.extent X, 59 S. eng
dc.description.version publishedVersion eng
tib.accessRights frei zug�nglich eng


Files in this item

This item appears in the following Collection(s):

Show simple item record

 

Search the repository


Browse

My Account

Usage Statistics