Bias Assessments of Benchmarks for Link Predictions over Knowledge Graphs

Sawischa, Sammy Fabian

Startseite
→
Fakultäten
→
Fakultät für Elektrotechnik und Informatik
→
Dokumentanzeige

Originalpublikation

Sawischa, Sammy Fabian: Bias Assessments of Benchmarks for Link Predictions over Knowledge Graphs. Hannover : Gottfried Wilhelm Leibniz Universität, Bachelor Thesis, 2023, X, 59 S. DOI: https://doi.org/10.15488/13600

Name: BachelorThesis.pdf

Größe: 10.39Mb

Format: PDF

Beschreibung: Bachelor Thesis

Öffnen

Zusammenfassung:
Link prediction (LP) aims to tackle the challenge of predicting new facts by reasoning over a knowledge graph (KG). Different machine learning architectures have been proposed to solve the task of LP, several of them competing for better performance on a few de-facto benchmarks. The problem of this thesis is the characterization of LP datasets regarding their structural bias properties and their effects on attained performance results. We provide a domain-agnostic framework that assesses the network topology, test leakage bias and sample selection bias in LP datasets. The framework includes SPARQL queries that can be reused in the explorative data analysis of KGs for uncovering unusual patterns. We finally apply our framework to characterize 7 common benchmarks used for assessing the task of LP. In conducted experiments, we use a trained TransE model to show how the two bias types affect prediction results. Our analysis shows problematic patterns in most of the benchmark datasets. Especially critical are the findings regarding the state-of-the-art benchmarks FB15k-237, WN18RR and YAGO3-10.
Lizenzbestimmungen:	CC BY 3.0 DE - http://creativecommons.org/licenses/by/3.0/de/
Publikationstyp:	BachelorThesis
Publikationsstatus:	publishedVersion
Erstveröffentlichung:	2023-04-26
Schlagwörter (deutsch):	Link-Vorhersagen, Benchmarks, Wissensgraphen, Stichprobenverzerrung, Informationsleck, Maschinelles Lernen
Schlagwörter (englisch):	Link prediction, Benchmarks, Knowledge graphs, Sample selection bias, Test leakage bias, Machine Learning
Fachliche Zuordnung (DDC):	004 \| Informatik

Downloadstatistik

Zur Langanzeige

Die Publikation erscheint in Sammlung(en):

Fakultät für Elektrotechnik und Informatik
Frei zugängliche Publikationen aus der Fakultät für Elektrotechnik und Informatik

Bias Assessments of Benchmarks for Link Predictions over Knowledge Graphs

Originalpublikation

Die Publikation erscheint in Sammlung(en):

Suche im Repositorium

Durchblättern

Gesamter Bestand

Diese Sammlung

Mein Nutzer/innenkonto

Nutzungsstatistiken